Discussion:
.Docx to .md raw text
Justus Caspers
2018-11-21 08:09:52 UTC
Permalink
Hi,

I'm trying to convert a .docx file to an .md file. Goal of the transformation is a exacte converation of the text from the Word file to the markdown file.

The word file looks like:

---
indicator: 3.c.1
layout: indicator
indicator_sort_order: '0260'
permalink: /3-c-1/
sdg_goal: '3'
---

If in convert the file with pandoc the result is:

\-\--
indicator: 3.c.1
layout: indicator
indicator\_sort\_order: \'0260\'
permalink: /3-c-1/
sdg\_goal: \'3\'
\-\--

Is there a option to prevent the addition of the backslashes?

Thx in advance for the help!
--
You received this message because you are subscribed to the Google Groups "pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+***@googlegroups.com.
To post to this group, send email to pandoc-***@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/23321d26-d910-4052-b87e-2aeacf8c14ef%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
John MacFarlane
2018-11-22 03:45:40 UTC
Permalink
If you want markdown output, you need the backslash
escapes, since characters like - will otherwise be
interpreted in ways you don't intend. (E.g., in
pandoc markdown, --- will turn into an em dash.)

If you just want the text, have you tried `-t plain`?
Post by Justus Caspers
Hi,
I'm trying to convert a .docx file to an .md file. Goal of the transformation is a exacte converation of the text from the Word file to the markdown file.
---
indicator: 3.c.1
layout: indicator
indicator_sort_order: '0260'
permalink: /3-c-1/
sdg_goal: '3'
---
\-\--
indicator: 3.c.1
layout: indicator
indicator\_sort\_order: \'0260\'
permalink: /3-c-1/
sdg\_goal: \'3\'
\-\--
Is there a option to prevent the addition of the backslashes?
Thx in advance for the help!
--
You received this message because you are subscribed to the Google Groups "pandoc-discuss" group.
To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/23321d26-d910-4052-b87e-2aeacf8c14ef%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google Groups "pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+***@googlegroups.com.
To post to this group, send email to pandoc-***@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/m2wop5dbbv.fsf%40johnmacfarlane.net.
For more options, visit https://groups.google.com/d/optout.
Justus Caspers
2018-11-22 08:40:39 UTC
Permalink
I didn't tried it before. But it's generates the result I was looking for.
Thx for the help
Post by John MacFarlane
If you want markdown output, you need the backslash
escapes, since characters like - will otherwise be
interpreted in ways you don't intend. (E.g., in
pandoc markdown, --- will turn into an em dash.)
If you just want the text, have you tried `-t plain`?
Post by Justus Caspers
Hi,
I'm trying to convert a .docx file to an .md file. Goal of the
transformation is a exacte converation of the text from the Word file to
the markdown file.
Post by Justus Caspers
---
indicator: 3.c.1
layout: indicator
indicator_sort_order: '0260'
permalink: /3-c-1/
sdg_goal: '3'
---
\-\--
indicator: 3.c.1
layout: indicator
indicator\_sort\_order: \'0260\'
permalink: /3-c-1/
sdg\_goal: \'3\'
\-\--
Is there a option to prevent the addition of the backslashes?
Thx in advance for the help!
--
You received this message because you are subscribed to the Google
Groups "pandoc-discuss" group.
Post by Justus Caspers
To unsubscribe from this group and stop receiving emails from it, send
<javascript:>.
Post by Justus Caspers
To view this discussion on the web visit
https://groups.google.com/d/msgid/pandoc-discuss/23321d26-d910-4052-b87e-2aeacf8c14ef%40googlegroups.com.
Post by Justus Caspers
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google Groups "pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+***@googlegroups.com.
To post to this group, send email to pandoc-***@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/f6848574-7afd-4f95-a49f-950d6755135d%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Loading...