Releases: jgm/pandoc
pandoc 2.1.1
-
Markdown reader:
- Don’t coalesce adjacent raw LaTeX blocks if they are separated by a blank line. See lierdakil/pandoc-crossref#160.
- Improved
inlinesInBalancedBrackets
(#4272, jgm/pandoc-citeproc#315). The change both improves performance and fixes a regression whereby normal citations inside inline notes and figure captions were not parsed correctly.
-
RST reader:
- Better handling for headers with an anchor (#4240). Instead of creating a Div containing the header, we put the id directly on the header. This way header promotion will work properly.
- Add aligned environment when needed in math (#4254).
rst2latex.py
uses analign*
environment for math in.. math::
blocks, so this math may contain line breaks. If it does, we put the math in analigned
environment to simulaterst2latex.py
’s behavior.
-
HTML reader:
- Fix col width parsing for percentages < 10% (#4262, n3fariox).
-
LaTeX reader:
-
Advance source position at end of stream.
-
Pass through macro defs in
rawLaTeXBlock
even if thelatex_macros
extension is set (#4246). This reverts to earlier behavior and is probably safer on the whole, since some macros only modify things in included packages, which pandoc’s macro expansion can’t modify. -
Fixed pos calculation in tokenizing escaped space.
-
Allow macro definitions inside macros (#4253). Previously we went into an infinite loop with
\newcommand{\noop}[1]{#1} \noop{\newcommand{\foo}[1]{#1}} \foo{hi}
-
Fix inconsistent column widths (#4238). This fixes a bug whereby column widths for the body were different from widths for the header in some tables.
-
-
Docx reader (Jesse Rosenthal):
- Parse hyperlinks in
instrText
tags (#3389, #4266). This was a form of hyperlink found in older versions of word. The changes introduced for this, though, create a framework for parsing further fields in MS Word (see the spec, ECMA-376-1:2016, §17.16.5, for more on these fields). We introduce a new module,Text.Pandoc.Readers.Docx.Fields
which contains a simple parsec parser. At the moment, only simple hyperlink fields are accepted, but that can be extended in the future.
- Parse hyperlinks in
-
Muse reader (Alexander Krotov):
- Parse
~~
as non-breaking space in Text::Amuse mode. - Refactor list parsing.
- Parse
-
Powerpoint writer (Jesse Rosenthal):
- Change reference to
notesSlide
toendNotesSlide
. - Move image sizing into
picProps
. - Improve table placement.
- Make our own
_rels/.rels
file. - Import reference-doc images properly.
- Move
Presentation.hs
out ofPandocMonad
. - Refactor into separate modules. T.P.W.Powerpoint.Presentation defines the Presentation datatype and goes Pandoc->Presentation; T.P.W.Pandoc.Output goes Presentation->Archive. Text.Pandoc.Writers.Powerpoint a thin wrapper around the two modules.
- Avoid overlapping blocks in column output.
- Position images correctly in two-column layout.
- Make content shape retrieval environment-aware.
- Improve image handling. We now determine image and caption placement by getting the dimensions of the content box in a given layout. This allows for images to be correctly sized and positioned in a different template. Note that images without captions and headers are no longer full-screened. We can’t do this dependably in different layouts, because we don’t know where the header is (it could be to the side of the content, for example).
- Read presentation size from reference file. Our presentation size is now dependent on the reference/template file we use.
- Handle (sub)headers above slidelevel correctly. Above the slidelevel, subheaders will be printed in bold and given a bit of extra space before them. Note that at the moment, no distinction is made between levels of headers above the slide header, though that can be changed.
- Check for required files. Since we now import from reference/dist file by glob, we need to make sure that we’re getting the files we need to make a non-corrupt Powerpoint. This performs that check.
- Improve templating using
--reference-doc
. Templating should work much more reliably now. - Include Notes slide in TOC.
- Set notes slide header to slide-level.
- Add table of contents. This is triggered by the
--toc
flag. Note that in a long slide deck this risks overrunning the text box. The user can address this by setting--toc-depth=1
. - Set notes slide number correctly.
- Clean up adding metadata slide. We want to count the slide numbers correctly if it’s in there.
- Add anchor links. For anchor-type links (
[foo](#bar)
) we produce an anchor link. In powerpoint these are links to slides, so we keep track of a map relating anchors to the slides they occur on. - Make the slide number available to the blocks. For anchors, block-processing functions need to know what slide number they’re in. We make the
envCurSlideId
available to blocks. - Move
curSlideId
to environment. - Allow setting
toc-title
in metadata. - Link notes to endnotes slide.
- Change reference to
-
Markdown writer:
- Fix cell width calculation (#4265). Previously we could get ever-lengthening cell widths when a table was run repeatedly through
pandoc -f markdown -t markdown
.
- Fix cell width calculation (#4265). Previously we could get ever-lengthening cell widths when a table was run repeatedly through
-
LaTeX writer:
- Escape
&
in lstinline (Robert Schütz).
- Escape
-
ConTeXt writer:
- Use xtables instead of Tables (#4223, Henri Menke). Default to xtables for context output. Natural Tables are used if the new
ntb
extension is set.
- Use xtables instead of Tables (#4223, Henri Menke). Default to xtables for context output. Natural Tables are used if the new
-
HTML writer:
- Fixed footnote backlinks with
--id-prefix
(#4235).
- Fixed footnote backlinks with
-
Text.Pandoc.Extensions
: AddedExt_ntb
constructor (API change, Henri Menke). -
Text.Pandoc.ImageSize
: add derivedEq
instance toDimension
(Jesse Rosenthal, API change). -
Lua filters (Albert Krewinkel):
- Make
PANDOC_READER_OPTIONS
available. The options which were used to read the document are made available to Lua filters via thePANDOC_READER_OPTIONS
global. - Add lua module
pandoc.utils.run_json_filter
, which runs a JSON filter on a Pandoc document. - Refactor filter-handling code into
Text.Pandoc.Filter.JSON
,Text.Pandoc.Filter.Lua
, andText.Pandoc.Filter.Path
. - Improve error messages. Provide more context about the task which caused an error.
- Make
-
data/pandoc.lua (Albert Krewinkel):
- Accept singleton inline as a list. Every constructor which accepts a list of inlines now also accepts a single inline element for convenience.
- Accept single block as singleton list. Every constructor which accepts a list of blocks now also accepts a single block element for convenience. Furthermore, strings are accepted as shorthand for
{pandoc.Str "text"}
in constructors. - Add attr, listAttributes accessors. Elements with attributes got an additional
attr
accessor. Attributes were accessible only via theidentifier
,classes
, andattributes
, which was in conflict with the documentation, which indirectly states that such elements have the anattr
property. - Drop
_VERSION
. Having a_VERSION
became superfluous, as this module is closely tied to the pandoc version, which is available viaPANDOC_VERSION
. - Fix access to Attr components. Accessing an Attr value (e.g.,
Attr().classes
) was broken; the more common case of accessing it via an Inline or Block element was unaffected by this.
-
Move
metaValueToInlines
to from Docx writer toText.Pandoc.Writers.Shared
, so it can be used by other writers (Jesse Rosenthal). -
MANUAL.txt:
-
Allow latest QuickCheck, tasty, criterion.
-
Remove custom prelude and ghc 7.8 support.
-
Reduce compiler noise (exact paths for compiled modules).
pandoc 2.1
-
Allow filters and lua filters to be interspersed (#4196). Previously we ran all lua filters before JSON filters. Now we run filters in the order they are presented on the command line, whether lua or JSON. There are two incompatible API changes: The type of
applyFilters
has changed, andapplyLuaFilters
has been removed.Filter
is also now exported. -
Use latest skylighting and omit the
missingIncludes
check, fixing a major performance regression in earlier releases of the 2.x series (#4226). Behavior change: If you use a custom syntax definition that refers to a syntax you haven’t loaded, pandoc will now complain when it is highlighting the text, rather than doing a check at the start. This change dramatically speeds up invocations of pandoc on short inputs. -
Text.Pandoc.Class: make
FileTree
opaque (don’t exportFileTree
constructor). This forces users to interact with it usinginsertInFileTree
andgetFileInfo
, which normalize file names. -
Markdown reader:
- Rewrite
inlinesInBalancedBrackets
. The rewrite is much more direct, avoidingparseFromString
. And it performs significantly better; unfortunately, parsing time still increases exponentially (see #1735). - Avoid parsing raw tex unless
\
+ letter seen. This seems to help with the performance problem, #4216.
- Rewrite
-
LaTeX reader: Simplified a check for raw tex command.
-
Muse reader (Alexander Krotov):
- Enable round trip test (#4107).
- Automatically translate
#cover
into#cover-image
. Amusewiki uses #cover directive to specify cover image.
-
Docx reader (Jesse Rosenthal):
- Allow for insertion/deletion of paragraphs (#3927). If the paragraph has a deleted or inserted paragraph break (depending on the track-changes setting) we hold onto it until the next paragraph. This takes care of accept and reject. For this we introduce a new state which holds the ils from the previous para if necessary. For
--track-changes=all
, we add an empty span with classparagraph-insertion
/paragraph-deletion
at the end of the paragraph prior to the break to be inserted or deleted. - Remove unused anchors (#3679). Docx produces a lot of anchors with nothing pointing to them—we now remove these to produce cleaner output. Note that this has to occur at the end of the process because it has to follow link/anchor rewriting.
- Read multiple children of
w:sdtContents
. - Combine adjacent anchors. There isn’t any reason to have numerous anchors in the same place, since we can’t maintain docx’s non-nesting overlapping. So we reduce to a single anchor.
- Improved tests.
- Allow for insertion/deletion of paragraphs (#3927). If the paragraph has a deleted or inserted paragraph break (depending on the track-changes setting) we hold onto it until the next paragraph. This takes care of accept and reject. For this we introduce a new state which holds the ils from the previous para if necessary. For
-
Muse writer (Alexander Krotov): don’t escape URIs from AST
-
Docx writer:
- Removed redundant subtitle in title (Sebastian Talmon).
firstRow
table definition compatibility for Word 2016 (Sebastian Talmon). Word 2016 seems to use a default value of “1” for table headers, if there is no firstRow definition (although a default value of 0 is documented), so all tables get the first Row formatted as header. Setting the parameter to 0 if the table has no header row fixes this for Word 2016- Fix custom styles with spaces in the name (#3290).
-
Powerpoint writer (Jesse Rosenthal):
- Ignore Notes div for parity with other slide outputs.
- Set default slidelevel correctly. We had previously defaulted to slideLevel 2. Now we use the correct behavior of defaulting to the highest level header followed by content. We change an expected test result to match this behavior.
- Split blocks correctly for linked images.
- Combine adjacent runs.
- Make inline code inherit code size. Previously (a) the code size wasn’t set when we force size, and (b) the properties was set from the default, instead of inheriting.
- Simplify
replaceNamedChildren
function. - Allow linked images. The following markdown:
[![Image Title](image.jpg)](http://www.example.com)
will now produce a linked image in the resulting PowerPoint file. - Fix error with empty table cell. We require an empty
<a:p>
tag, even if the cell contains no paragraphs—otherwise PowerPoint complains of corruption. - Implement two-column slides. This uses the columns/column div format described in the pandoc manual. At the moment, only two columns (half the screen each) are allowed. Custom widths are not supported.
- Added more tests.
-
OpenDocument/ODT writers: improved rendering of formulas (#4170, oltolm).
-
Lua filters (Albert Krewinkel):
-
data/pandoc.lua
: drop ‘pandoc-api-version’ from Pandoc objects -
The current pandoc-types version is made available to Lua programs in the global
PANDOC_API_VERSION
. It contains the version as a list of numbers. -
The pandoc version available as a global
PANDOC_VERSION
(a list of numbers). -
data/pandoc.lua
: makeAttr
anAstElement
. -
data/pandoc.lua
: make all types subtypes ofAstElement
.Pandoc
,Meta
, andCitation
were just plain functions and did not set a metatable on the returned value, which made it difficult to amend objects of these types with new behavior. They are now subtypes of AstElement, meaning that all their objects can gain new features when a method is added to the behavior object (e.g.,pandoc.Pandoc.behavior
). -
data/pandoc.lua
: split type and behavior tables. Clearly distinguish between a type and the behavioral properties of an instance of that type. The behavior of a type (and all its subtypes) can now be amended by adding methods to that typesbehavior
object, without exposing the type objects internals. E.g.:pandoc.Inline.behavior.frob = function () print'42' end local str = pandoc.Str'hello' str.frob() -- outputs '42'
-
data/pandoc.lua
: fix Element inheritance. Extending all elements of a given type (e.g., all inline elements) was difficult, as the table used to lookup unknown methods would be reset every time a new element of that type was created, preventing recursive property lookup. This is was changed in that all methods and attributes of supertypes are now available to their subtypes. -
data/pandoc.lua
: fix attribute names of Citation (#4222). The fields were named like the Haskell fields, not like the documented, shorter version. The names are changed to match the documentation and Citations are given a shared metatable to enable simple extensibility. -
data/pandoc.lua
: drop functionpandoc.global_filter
. -
Bump
hslua
version to 0.9.5. This version fixes a bug that made it difficult to handle failures while getting lists or a Map from Lua. A bug in pandoc, which made it necessary to always pass a tag when using MetaList or MetaBlock, is fixed as a result. Using the pandoc module’s constructor functions for these values is now optional (if still recommended). -
Stop exporting
pushPandocModule
(API change). The introduction ofrunPandocLua
renders direct use of this function obsolete. -
Update generation of module docs for lua filters.
-
Lua.Module.Utils
: make stringify work onMetaValues
(John MacFarlane). I’m sure this was intended in the first place, but currently onlyMeta
is supported.
-
-
Improve benchmarks.
- Set the default extensions properly.
- Improve benchmark argument parsing. You can now say
make bench BENCHARGS="markdown latex reader"
and both the markdown and latex readers will be benchmarked.
-
MANUAL.txt simplify and add more structure (Mauro Bieg).
-
Generate README.md from template and MANUAL.txt.
make README.md
will generate the README.md after changes to MANUAL.txt have been made. -
Update copyright notices to include 2018 (Albert Krewinkel).
pandoc 2.0.6
-
Added
jats
as an input format.- Add Text.Pandoc.Readers.JATS, exporting
readJATS
(API change) (Hamish Mackenzie). - Improved citation handling in JATS reader. JATS citations are now converted to pandoc citations, and JATS ref-lists are converted into a
references
field in metadata, suitable for use with pandoc-citeproc. Thus a JATS article with embedded bibliographic information can be processed with pandoc and pandoc-citeproc to produce a formatted bibliography.
- Add Text.Pandoc.Readers.JATS, exporting
-
Allow
--list-extensions
to take an optional FORMAT argument. This lists the extensions set by default for the selected FORMAT. The extensions are now alphabetized, and the+
or-
indicating the default setting comes before, rather than after, the extension. -
Markdown reader:
- Preserve original whitespace between blocks.
- Recognize
\placeformula
as context. - Be pickier about table captions. A caption starts with a
:
which can’t be followed by punctuation. Otherwise we can falsely interpret the start of a fenced div, or even a table header line like:--:|:--:
, as a caption. - Always use four space rule for example lists. It would be awkward to indent example list contents to the first non-space character after the label, since example list labels are often long. Thanks to Bernhard Fisseni for the suggestion.
- Improve raw tex parsing. Note that the Markdown reader is also affected by the
latex_macros
extension changes described below under the LaTeX reader.
-
LaTeX reader:
latex_macros
extension changes (#4179). Don’t pass through macro definitions themselves whenlatex_macros
is set. The macros have already been applied. Iflatex_macros
is enabled, thenrawLaTeXBlock
in Text.Pandoc.Readers.LaTeX will succeed in parsing a macro definition, and will update pandoc’s internal macro map accordingly, but the empty string will be returned.- Export
tokenize
,untokenize
(API change). - Use
applyMacros
inrawLaTeXBlock
,rawLaTeXInline
. - Refactored
inlineCommand
. - Fix bug in tokenizer. Material following
^^
was dropped if it wasn’t a character escape. This only affected invalid LaTeX, so we didn’t see it in the wild, but it appeared in a QuickCheck test failure. - Fix regression in LateX tokenization (#4159). This mainly affects the Markdown reader when parsing raw LaTeX with escaped spaces.
- Add tests of LaTeX tokenizer.
- Support
\foreignlanguage
from babel. - Be more tolerant of
&
character (#4208). This allows us to parse unknown tabular environments as raw LaTeX.
-
Muse reader (Alexander Krotov):
- Parse anchors immediately after headings as IDs.
- Require that note references does not start with 0.
- Parse empty comments correctly.
-
Org reader (Albert Krewinkel):
- Fix asterisks-related parsing error (#4180).
- Support minlevel option for includes (#4154). The level of headers in included files can be shifted to a higher level by specifying a minimum header level via the
:minlevel
parameter. E.g.#+include: "tour.org" :minlevel 1
will shift the headers in tour.org such that the topmost headers become level 1 headers. - Break-up org reader test file into multiple modules.
-
OPML reader:
- Enable raw HTML and other extensions by default for notes (#4164). This fixes a regression in 2.0. Note that extensions can now be individually disabled, e.g.
-f opml-smart-raw_html
.
- Enable raw HTML and other extensions by default for notes (#4164). This fixes a regression in 2.0. Note that extensions can now be individually disabled, e.g.
-
RST reader:
-
Docx reader:
- Continue lists after interruption (#4025, Jesse Rosenthal). Docx expects that lists will continue where they left off after an interruption and introduces a new id if a list is starting again. So we keep track of the state of lists and use them to define a “start” attribute, if necessary.
- Add tests for structured document tags unwrapping (Jesse Rosenthal).
- Preprocess Document body to unwrap
w:sdt
elements (Jesse Rosenthal, #4190).
-
Plain writer:
- Don’t linkify table of contents.
-
RST writer:
- Fix anchors for headers (#4188). We were missing an
_
.
- Fix anchors for headers (#4188). We were missing an
-
PowerPoint writer (Jesse Rosenthal):
- Treat lists inside BlockQuotes as lists. We don’t yet produce incremental lists in PowerPoint, but we should at least treat lists inside BlockQuotes as lists, for compatibility with other slide formats.
- Add ability to force size. This replaces the more specific
blockQuote runProp
, which only affected the size of blockquotes. We can use this for notes, etc. - Implement notes. This currently prints all notes on a final slide. Note that at the moment, there is a danger of text overflowing the note slide, since there is no logic for adding further slides.
- Implement basic definition list functionality to PowerPoint writer.
- Don’t look for default template file for Powerpoint (#4181).
- Add pptx to isTextFormat list. This is used to check standalone and not writing to the terminal.
- Obey slide level option (Jesse Rosenthal).
- Introduce tests.
-
Docx writer:
- Ensure that
distArchive
is the one that comes with pandoc (#4182). Previously areference.docx
in~/.pandoc
(or the user data dir) would be used instead, and this could cause problems because a user-modified docx sometimes lacks vital sections that we count on thedistArchive
to supply.
- Ensure that
-
Org writer:
- Do not wrap “-” to avoid accidental bullet lists (Alexander Krotov).
- Don’t allow fn refs to wrap to beginning of line (#4171, with help from Alexander Krotov). Otherwise they can be interpreted as footnote definitions.
-
Muse writer (Alexander Krotov):
- Don’t wrap note references to the next line (#4172).
-
HTML writer:
- Use br elements in line blocks instead of relying on CSS (#4162). HTML-based templates have had the custom CSS for
div.line-block
removed. Those maintaining custom templates will want to remove this too. We still enclose line blocks in a div with classline-block
.
- Use br elements in line blocks instead of relying on CSS (#4162). HTML-based templates have had the custom CSS for
-
LaTeX writer:
-
FB2 writer:
- Add cover image specified by
cover-image
meta (Alexander Krotov, #4195).
- Add cover image specified by
-
JATS writer (Hamish Mackenzie):
- Support writing
<fig>
and<table-wrap>
elements with<title>
and<caption>
inside them by using Divs with class set to one offig
,table-wrap
orcaption
(Hamish Mackenzie). The title is included as a Heading so the constraint on where Heading can occur is also relaxed. - Leave out empty alt attributes on links.
- Deduplicate image mime type code.
- Make
<p>
optional in<td>
and<th>
(#4178). - Self closing tags for empty xref (#4187).
- Improve support for code language.
- Support writing
-
Custom writer:
- Use init file to setup Lua interpreter (Albert Krewinkel). The same init file (
data/init
) that is used to setup the Lua interpreter for Lua filters is also used to setup the interpreter of custom writers.lua. - Define instances for newtype wrapper (Albert Krewinkel). The custom writer used its own
ToLuaStack
instance definitions, which made it difficult to share code with Lua filters, as this could result in conflicting instances. AStringify
wrapper is introduced to avoid this problem. - Added tests for custom writer.
- Fixed definition lists and tables in
data/sample.lua
.
- Use init file to setup Lua interpreter (Albert Krewinkel). The same init file (
-
Fixed regression: when target is PDF, writer extensions were being ignored. So, for example,
pandoc -t latex-smart -o file.pdf
did not work properly. -
Lua modules (Albert Krewinkel):
- Add
pandoc.utils
module, to hold utility functions. - Create a Haskell module Text.Pandoc.Lua.Module.Pandoc to define the
pandoc
lua module. - Make a Haskell module for each Lua module. Move definitions for the
pandoc.mediabag
modules to a separate Haskell module. - Move
sha1
from the mainpandoc
module topandoc.utils
. - Add function
pandoc.utils.hierarchicalize
(convert list of Pandoc blocks into (hierarchical) list of Elements). - Add function
pandoc.utils.normalize_date
(parses a date and converts it (if possible) to “YYYY-MM-DD” format). - Add function
pandoc.utils.to_roman_numeral
(allows conversion of numbers below 4000 into roman numerals). - Add function
pandoc.utils.stringify
(converts any AST element to a string with formatting removed). data/init.lua
: loadpandoc.utils
by default- Turn pipe, read into full Haskell functions. The
pipe
andread
utility functions are converted from hybrid lua/haskell functions into full Haskell functions. This avoids the need for intermediate_pipe
/_read
helper functions, which have dropped. - pandoc.lua: re-add missing MetaMap function. This was a bug introduced in version 2.0.4.
- Add
-
Text.Pandoc.Class: Add
insertInFileTree
[API change]. This gives a pure way to insert an ersatz file into aFileTree
. In addition, we normalize paths both on insertion and on lookup. -
Text.Pandoc.Shared: export
blocksToInlines'
(API change, Maura Bieg). -
Text.Pandoc.MIME: Add opus to MIME type table as audio/ogg (#4198)....
pandoc 2.0.5
-
Fix a bug in 2.0.4, whereby pandoc could not read the theme files
generated with--print-highlight-style
(#4133). Improve JSON
serialization of styles. -
Fix CSS issues involving line numbers (#4128).
Highlighted code blocks are now enclosed in a div with classsourceCode
.
Highlighting CSS no longer sets a generic color for pre and code; we only
set these for classsourceCode
. -
--pdf-engine-opt
: fix bug where option order was reversed (#4137). -
Add PowerPoint (pptx) writer (Jesse Rosenthal).
It works following the standard Pandoc conventions for making other
sorts of slides. Caveats:- Syntax highlighting is not yet implemented. (This is difficult
because there are no character classes in Powerpoint.) - Footnotes and Definition lists are not yet implemented. (Notes will
usually take the form of a final slide. - Image placement and auto-resizing has a few glitches.
- Reference powerpoint files don’t work dependably from the command
line. This will be implemented, but at the moment users are advised
to change themes from within Powerpoint.
- Syntax highlighting is not yet implemented. (This is difficult
-
Create shared Text.Pandoc.Writers.OOXML module (Jesse Rosenthal).
This is for functions used by both Powerpoint and Docx writers. -
Add default pptx data for Powerpoint writer (Jesse Rosenthal).
-
Add
empty_paragraphs
extension.- Deprecate
--strip-empty-paragraphs
option. Instead we now
use anempty_paragraphs
extension that can be enabled on
the reader or writer. By default, disabled. - Add
Ext_empty_paragraphs
constructor toExtension
. - Revert “Docx reader: don’t strip out empty paragraphs.”
This reverts commit d6c58eb. - Implement
empty_paragraphs
extension in docx reader and writer,
opendocument writer, html reader and writer. - Add tests for
empty_paragraphs
extension.
- Deprecate
-
Markdown reader:
- Don’t parse native div as table caption (#4119).
- Improved computation of column widths in pipe tables.
Pipe tables with lines longer than the text width (as set
by--columns
) are now scaled to text width, with the relative
widths of columns determined by the ratios between the
header lines. Previously we computed column widths using
the ratio of header line lengths to column width, so that
tables with narrow header lines were extremely thin, which
was very rarely the desired result.
-
LaTeX reader: fix
\
before newline (#4134). This should be a space,
as long as it’s not followed by a blank line. This has been fixed at the
tokenizer level. -
Muse reader (Alexander Krotov):
- Add test for
#disable-tables
directive in Emacs mode. - Don’t allow emphasis to be preceded by letter.
- Add underline support in Emacs Muse mode..
- Support multiline directives in Amusewiki mode
- Add test for
-
Man writer: omit internal links (#4136). That is, just print the link
text without the URL. -
Markdown reader: accept processing instructions as raw HTML (#4125).
-
Lua filters (Albert Krewinkel):
- Use script to initialize the interpreter. The file
init.lua
is
used to initialize the Lua interpreter which is used in Lua filters.
This gives users the option to require libraries which they want to
use in all of their filters, and to extend default modules. - Fix package loading for Lua 5.1. The list of package searchers is
namedpackage.loaders
in Lua 5.1 and LuaJIT, andpackage.searchers
in Lua 5.2 and later. - Refactor lua module handling. The integration with Lua’s package/module
system is improved: A pandoc-specific package searcher is prepended to
the searchers inpackage.searchers
. The modulespandoc
and
pandoc.mediabag
can now be loaded viarequire
. - Bump lower bound of hslua. The release hslua 0.9.3 contains a new
function which makes using Haskell functions as package loaders much
easier.
- Use script to initialize the interpreter. The file
-
reveal.js template: add title-slide identifier to title slide (#4120).
This allows it to be styled more easily. -
LaTeX template: Added support for
pagestyle
variable (#4135,
Thomas Hodgson) -
Add
-threaded
to ghc-options for executable (#4130, fixes a build
error on linux).
pandoc 2.0.4
-
Add
--print-highlight-style
option. This generates a JSON version
of a highlighting style, which can be saved as a.theme
file, modified,
and used with--highlight-style
(#4106, #4096). -
Add
--strip-empty-paragraphs
option. This works for any input format.
It is primarily intended for use with docx and odt documents where
empty paragraphs have been used for inter-paragraph spaces. -
Support
--webtex
forgfm
output. -
Recognize
.muse
file extension. -
Support beamer
\alert
in LaTeX reader. Closes #4091. -
Docx reader: don’t strip out empty paragraphs (#2252).
Users who have a conversion pipeline from docx may want to consider adding
--strip-empty-paragraphs
to the command line. -
Org reader (Albert Krewinkel): Allow empty list items (#4090).
-
Muse reader (Alexander Krotov):
- Parse markup in definition list terms.
- Allow definition to end with EOF.
- Make code blocks round trip.
- Drop common space prefix from list items.
- Add partial round trip test.
- Don’t interpret XML entities.
- Remove
nested
. - Parse
~~
as non-breaking space in Emacs mode. - Correctly remove indentation from notes. Exactly one space is
required and considered to be part of the marker. - Allow list items to be empty.
- Add ordered list test.
- Add more multiline definition tests.
- Don’t allow blockquotes within lists.
- Fix reading of multiline definitions.
- Add inline
<literal>
support. - Concatenate inlines of the same type
-
Docx writer: allow empty paragraphs (#2252).
-
CommonMark/gfm writer:
- Use raw html for native divs/spans (#4113). This allows a pandoc
markdown native div or span to be rendered in gfm using raw html tags. - Implement
raw_html
andraw_tex
extensions. Note thatraw_html
is enabled by default forgfm
, whileraw_tex
is disabled by default.
- Use raw html for native divs/spans (#4113). This allows a pandoc
-
Muse writer (Alexander Krotov):
- Test that inline math conversion result is normalized.
Without normalization this test produced
<em>a</em><em>b</em><em>c</em>
. - Improve inline list normalization and move to writer.
- Escape hash symbol.
- Escape
----
to avoid accidental horizontal rules. - Escape only
</code>
inside code tag. - Additional
<verbatim>
is not needed as<code>
is verbatim already.
- Test that inline math conversion result is normalized.
-
LaTeX writer:
- Allow specifying just width or height for image size.
Previously both needed to be specified (unless the image was
being resized to be smaller than its original size).
If height but not width is specified, we now set width to
textwidth. If width but not height is specified, we now set
height to textheight. Since we havekeepaspectratio
, this
yields the desired result. - Escape
~
and_
in code with--listings
(#4111).
- Allow specifying just width or height for image size.
-
HTML writer: export
tagWithAttributes
. This is a helper allowing
other writers to create single HTML tags. -
Let papersizes
a0
,a1
,a2
, … be case-insensitive by
converting the case as needed in LaTeX and ConTeXt writers. -
Change
fixDisplayMath
fromText.Pandoc.Writers.Shared
so that it no longer produces emptyPara
’s as an artifact. -
Text.Pandoc.Shared.blocksToInlines
: rewrote using builder.
This gives us automatic normalization, so we don’t get
for example two consecutive Spaces. -
Include default CSS for ‘underline’ class in HTML-based templates.
-
revealjs template: add
tex2jax
configuration for the
math plugin. With the next release of reveal.js, this will
fix the problem of$
s outside of math contexts being
interpreted as math delimiters (#4027). -
pandoc.lua
module for use in lua filters (Albert Krewinkel):-
Add basic lua List module (#4099, #4081). The List module is
automatically loaded, but not assigned to a global variable. It can be
included in filters by callingList = require 'List'
. Lists of blocks,
lists of inlines, and lists of classes are now givenList
as a metatable,
making working with them more convenient. E.g., it is now possible to
concatenate lists of inlines using Lua’s concatenation operator..
(requires at least one of the operants to haveList
as a metatable):function Emph (emph) local s = {pandoc.Space(), pandoc.Str 'emphasized'} return pandoc.Span(emph.content .. s) end
The
List
metatable is assigned to the tables which get passed to
the constructorsMetaBlocks
,MetaInline
, andMetaList
. This
enables the use of the resulting objects as lists. -
Lua/StackInstances
: push Pandoc and Meta via constructor.
Pandoc and Meta elements are now pushed by calling the respective
constructor functions of the pandoc Lua module. This makes serialization
consistent with the way blocks and inlines are pushed to lua and allows
to use List methods with theblocks
value. -
Add documentation for pandoc.List in
lua-filters.md
.
-
-
Use latest tagsoup. This fixes a bug in parsing HTML tags with
&
(but not a valid entity) following them (#4094, #4088). -
Use skylighting 0.4.4.1, fixing the color of unmarked code text
whennumberLines
is used (#4103). -
Make
normalizeDate
more forgiving (Mauro Bieg, #4101), not
requiring a leading 0 on single-digit days. -
Fix
--help
output for--highlight-style
to includeFILE
(Mauro
Bieg, #4095). -
Clearer deprecation warning for
--latexmathml, --asciimathml, -m
.
Previously we only mentioned--latexmathml
, even if-m
was
used. -
Changelog: fix description of lua filters in 2.0 release
(Albert Krewinkel). Lua filters were initially run after conventional
(JSON) filters. However, this was changed later to make it easier to deal
with files in the mediabag. The changelog is updated to describe that
feature of the 2.0 release correctly. -
Change Generic JSON instances to TemplateHaskell (Jasper Van der Jeugt,
#4085). This reduces compile time and memory usage significantly. -
lua-filters.md
: Added tikz filter example. -
Create alternative zip file for macOS binaries.
-
Create alternative zip file for Windows binaries.
-
Update INSTALL.md since we now provide zips for binaries.
-
Relax
http-types
dependency (Justus Sagemüller, #4084). -
Add
epub.md
,getting-started.md
to docs. These used to live in
the website repo. -
Add
packages
target to Makefile. -
Bump bounds for binary, http-types, tasty-hunit
pandoc 2.0.3
-
Lua filters: preload text module (Albert Krewinkel, #4077).
Thetext
module is preloaded in lua. The module contains some UTF-8
aware string functions, implemented in Haskell. The module is loaded on
request only, e.g.:text = require 'text' function Str (s) s.text = text.upper(s.text) return s end
-
Allow table-like access to attributes in lua filters (Albert Krewinkel,
#4071). Attribute lists are represented as associative lists in Lua. Pure
associative lists are awkward to work with. A metatable is attached to
attribute lists, allowing to access and use the associative list as if
the attributes were stored in as normal key-value pair in table.
Note that this changes the waypairs
works on attribute lists. Instead
of producing integer keys and two-element tables, the resulting iterator
function now returns the key and value of those pairs. Useipairs
to
get the old behavior. Warning: the new iteration mechanism only works if
pandoc has been compiled with Lua 5.2 or later (current default: 5.3). -
Text.Pandoc.Parsing.uri: allow
&
and=
as word characters (#4068).
This fixes a bug where pandoc would stop parsing a URI with an
empty attribute: for example,&a=&b=
wolud stop ata
.
(The uri parser tries to guess which punctuation characters
are part of the URI and which might be punctuation after it.) -
Introduce
HasSyntaxExtensions
typeclass (Alexander Krotov, #4074).- Added new
HasSyntaxExtensions
typeclass forReaderOptions
and
WriterOptions
. - Reimplemented
isEnabled
function fromOptions.hs
to accept both
ReaderOptions
andWriterOptions
. - Replaced
enabled
fromCommonMark.hs
with newisEnabled
.
- Added new
-
Add
amuse
extension (Alexander Krotov) to enable Amuse wiki
behavior formuse
. NewExt_amuse
constructor for
Extension
. Note: this is switched on by default; for
Emacs behavior, usemuse-amuse
. -
Muse reader (Alexander Krotov):
- Count only one space as part of list item marker.
- Produce SoftBreaks on newlines. Now wrapping can be preserved
with--wrap=preserve
. - Add Text::Amuse footnote extensions. Footnote end is indicated by
indentation, so footnotes can be placed anywhere in the text,
not just at the end of it. - Accept Emacs Muse definition lists when
-amuse
.
Emacs Muse does not require indentation.
-
HTML reader:
- Ensure we don’t produce level 0 headers (#4076), even for chapter
sections in epubs. This causes problems because writers aren’t set
up to expect these. - Allow spaces after
\(
and before\)
withtex_math_single_backslash
.
Previously\( \frac{1}{a} < \frac{1}{b} \)
was not parsed as math in
markdown
orhtml
+tex_math_single_backslash
.
- Ensure we don’t produce level 0 headers (#4076), even for chapter
-
MANUAL: clarify that math extensions work with HTML.
Clarify thattex_math_dollars
andtex_math_single_backslash
will work with HTML as well as Markdown. -
Creole reader: Fix performance issue for longer lists (Sascha Wilde,
#4067). -
RST reader: better support for ‘container’ directive (#4066).
Create a div, incorporate name attribute and classes. -
LaTeX reader:
-
EPUB writer: Fixed path for cover image (#4069). It was previously
media/media/imagename
, and should have beenmedia/imagename
. -
Markdown writer: fix bug with doubled footnotes in grid tables
(#4061). -
LaTeX template: include natbib/biblatex after polyglossia (#4073).
Otherwise we seem to get an error; biblatex wants polyglossia
language to be defined. -
Added examples to lua filters documentation.
pandoc 2.0.2
-
Deprecated ancient HTML math methods:
--latexmathml
,--gladtex
,
--mimetex
,--jsmath
. -
Fixed URIs in
data/jats.csl
. They were being rendered twice,
leading to invalid XML in default JATS output with pandoc-citeproc. -
lua-filters.md
: use real-world man page filter as example. -
Add lua filter functions
walk_inline
andwalk_block
in the pandoc module, to apply filters inside particular
inline and block elements. -
Refactored some code from
Text.Pandoc.Lua.PandocModule
into new internal moduleText.Pandoc.Lua.Filter
. -
Markdown reader:
- Allow fenced code blocks to be indented 1-3 spaces (#4011).
This brings our handling of them into alignment with CommonMark’s. - Fix YAML metadata with “chomp” (
|-
). Previously if a
YAML block under|-
contained a blank line, pandoc would
not parse it as metadata.
- Allow fenced code blocks to be indented 1-3 spaces (#4011).
-
Removed
etc.
from abbreviations file. Oftenetc.
ends a
sentence, and we want the period to be treated as a
sentence-ending period. -
Fix regression with
--metadata
(#4054). Values specified with
--metadata
should replace a metadata value set in the document
itself, rather than creating a list including a new value. -
EPUB writer:
-
Miscellaneous code cleanup (Alexander Krotov).
-
Use pandoc-types 1.17.3, which adds
Walkable
instances
for[Block] Block
and[Inline] Inline
. -
Remove obsolete
stack.full.yaml
(#4052). -
Change to using pandoc-citeproc 0.12.1 in binary packages.
-
Consolidate math output method documentation (#4049, Mauro Bieg).
-
MANUAL.txt
: fix header level of “Extension: emoji” (Albert Krewinkel). -
Use lua filter to generate man page from
MANUAL.txt
, replacing old
Haskell filters. This is easier and faster. -
Improved
INSTALL.md
. -
Update commands to extract deb archive on Linux (#4043, Salim B).
pandoc 2.0.1.1
-
Improved fix to #3989 (parsing of HTML tags containing
>
in an attribute or comment). The previous fix (in 2.0.1) only
worked in certain cases. -
FB2 writer (Alexander Krotov):
- Add
unrecognised
genre to<title-info>
(Alexander Krotov). XML schema requires at least one genre. - Remove
<annotation>
from<body>
.
- Add
-
CommonMark writer: fix strikethrough for
gfm
(#4038). -
Use texmath 0.10, which adds support for a wider range of
symbols and fixes default column alignments in MathML
and OMML. -
Highlighting fixes, using skylighting 0.4.3.2:
- Fix invalid CSS.
- Support
lineAnchors
(orline-anchors
) in HTML code blocks. - Ensure that code lines don’t get duplicate identifiers (#4031).
The line identifiers are built using the code block’s identifier
as a prefix. If the code block has null identifier, we use
cb1
,cb2
, etc.
-
Added a few abbreviations to
data/abbreviations
,
and sorted the list (#3984, Wandmalfarbe). -
Improved support for columns in HTML writer (#4028).
- Remove
width
attribute from thediv
. - Remove space between
<div class="column">
elements,
since this prevents columns whose widths sum to 100%
(the space takes up space). - Move as much as possible of the CSS to the template.
- Ensure that all the HTML-based templates (including epub)
contain the CSS for columns. - Columns default to 50% width unless they are given a width
attribute. So if you want two equal-width columns, you
can use a div with classcolumn
and nowidth
attribute.
- Remove
-
SelfContained: use
base64
for css links with media attribute (#4026).
This fixes--self-contained
with S5. -
Improve
pandoc-template-mode.el
(Vaclav Haisman). -
Issue INFO, not WARNING, when a .sty file cannot be
read in LaTeX reader. It is normally not an issue requiring
a fix from the user if .sty files are not found. -
INSTALL.md: MacOS instructions needed xar -f (adam234).
-
MANUAL.txt:
pandoc 2.0.1
-
Fixed regression in parsing of HTML comments in markdown and other
non-HTML formats (Text.Pandoc.Readers.HTML.htmlTag
) (#4019).
The parser stopped at the first>
character, even if it wasn't
the end of the comment. -
Creole reader (Sascha Wilde):
- Fix some minor typos and formatting.
- Add additional test on nowiki-block after para.
- Fix lists with trailing white space.
-
LaTeX reader: handle
%
comment right after command.
For example,\emph%
. -
Markdown reader: make sure fenced div closers work in lists.
Previously the following failed:::: {.class} 1. one 2. two :::
and you needed a blank line before the closing
:::
. -
Make
fenced_divs
affect the Markdown writer. Iffenced_divs
is
enabled, Divs will be rendered as fenced divs. -
LaTeX/Beamer writer: support "blocks" inside columns and other Divs
(#4016). -
HTML Writer: consistently use dashed class-names (Mauro Bieg, #3556).
Note: this change may require some changes in CSS rules.
footnoteRef
has becomefootnote-ref
,titleslide
has
becometitle-slide
, andfootnoteBack
has becomefootnote-back
. -
JATS writer: Properly pass through author metadata (#4020).
-
FB2 writer (Alexander Krotov):
- Write blocks outside of
<p>
in definitions. - Make bullet lists consistent with ordered lists, repeating
the marker for the outer list rather than indenting sublists,
since indentation does not work in readers. - Add new style FB2 tests.
- Write blocks outside of
-
Text.Pandoc.ImageSize
: AddMillimeter
constructor toDimension
(#4012) [API change]. Now sizes given in 'mm' are no longer converted
to 'cm'. -
Revise documentation of small caps syntax (Andrew Dunning, #4013).
-
Fix broken reference links in manual (Andrew Dunning, #4014)
-
Fixed example of slide columns structure in changelog (#4015).
Also documented this feature in MANUAL.txt.
pandoc 2.0.0.1
-
EPUB writer:
- Fixed filepaths for nonstandard epub-subdirectory values.
- Ensure that epub2 is recognized as a non-text format, so that a template is used.
- Don't include "prefix" attribute for ibooks for epub2. It doesn't validate.
- Fix stylesheet paths; previously we had an incorrect stylesheet path for the cover page and nav page.
-
LaTeX reader:
-
Linux package build: ensure that pandoc-citeproc is statically linked.
-
trypandoc: add native, ms.