Détail du package

@unified-latex/unified-latex-util-catcode

siefkenj33.7kMIT1.8.3

Tools for manipulating unified-latex ASTs

pegjs, latex, parser, prettier

readme

unified-latex-util-catcode

What is this?

Functions to identify regions of a unified-latex Abstract Syntax Tree (AST) that need to be reparsed because of different category codes. For example, regions between \makeatletter and \makeatother.

When should I use this?

If you need to identify regions of the AST that need to be reparsed.

Install

npm install @unified-latex/unified-latex-util-catcode

This package contains both esm and commonjs exports. To explicitly access the esm export, import the .js file. To explicitly access the commonjs export, import the .cjs file.

Functions

findExpl3AndAtLetterRegionsInArray(tree)

Find regions between \ExplSyntaxOn...\ExplSyntaxOff and \makeatletter...\makeatother. Returns an object containing regions where one or both syntax's apply.

function findExpl3AndAtLetterRegionsInArray(tree: Ast.Node[]): {
  explOnly: Region[];
  atLetterOnly: Region[];
  both: Region[];
};

Parameters

Param Type
tree Ast.Node[]

findRegionInArray(tree, start, end)

Find all contiguous segments in the array that are between start and end blocks. The start and end are functions that determine when a region starts and ends.

function findRegionInArray(
  tree: Ast.Node[],
  start: (node: Ast.Node) => boolean,
  end: (node: Ast.Node) => boolean
): Region[];

Parameters

Param Type
tree Ast.Node[]
start (node: Ast.Node) => boolean
end (node: Ast.Node) => boolean

hasReparsableMacroNames(tree, allowedTokens)

Checks whether tree has a macro that could be reparsed given the allowedTokens but do not do any reparsing. This function can be used in auto-detection schemes to determine if macro names should actually be reparsed.

function hasReparsableMacroNames(
  tree: Ast.Ast,
  allowedTokens: string | Set<string>
): boolean;

Parameters

Param Type
tree Ast.Ast
allowedTokens `string \ Set<string>`

hasReparsableMacroNamesInArray(tree, allowedTokens)

Checks whether the array has a macro that could be reparsed given the allowedTokens but do not do any reparsing. This function can be used in auto-detection schemes to determine if macro names should actually be reparsed.

function hasReparsableMacroNamesInArray(
  tree: Ast.Node[],
  allowedTokens: Set<string>
): boolean;

Parameters

Param Type
tree Ast.Node[]
allowedTokens Set<string>

reparseExpl3AndAtLetterRegions(tree)

Find regions between \ExplSyntaxOn...\ExplSyntaxOff and \makeatletter...\makeatother and reparse their contents so that the relevant characters (e.g., @, _, and :) become part of the macro names.

function reparseExpl3AndAtLetterRegions(tree: Ast.Ast): void;

Parameters

Param Type
tree Ast.Ast

reparseMacroNames(tree, allowedTokens)

Reparses all macro names so that they may optionally include characters listed in allowedTokens. This is used, for example, when parsing expl3 syntax which allows _ to be used in a macro name (even though _ is normally stops the parsing for a macro name). Thus, a macro \foo_bar:Nn would be parsed as having the name foo_bar:Nn rather than as foo followed by the strings _, bar, :, Nn.

function reparseMacroNames(
  tree: Ast.Ast,
  allowedTokens: string | Set<string>
): void;

Parameters

Param Type
tree Ast.Ast
allowedTokens `string \ Set<string>`

reparseMacroNamesInArray(tree, allowedTokens)

Reparses all macro names in the array so that they may optionally include characters listed in allowedTokens. This is used, for example, when parsing expl3 syntax which allows _ to be used in a macro name (even though _ is normally stops the parsing for a macro name).

function reparseMacroNamesInArray(
  tree: Ast.Node[],
  allowedTokens: Set<string>
): void;

Parameters

Param Type
tree Ast.Node[]
allowedTokens Set<string>

changelog

unified-latex Changelog

v1.8.3

  • Support \ref in PreTeXt conversion
  • Better use of UnifiedJS to parse but not print LaTeX
  • Support for \verb, \textsuperscript, \textsubscript, \sout, and \" i in HTML conversion

v1.8.2

  • Upgraded dependencies

v1.8.1

  • Changed Peggy to implement a caching parser to prevent large slowdown on some files.

v1.8.0

  • Added initial PreTeXt conversion support
  • Upgraded deps
  • Added amsart macros
  • Consume the whitespace after special character macros when expanding ligatures. For example \o y produces øy instead of ø y
  • Fix signatures of \hyphenation

v1.7.1

  • Types fix for @unified-latex/unified-latex-types
  • Fixed AST when expanding \sysdelim macros for rendering \systeme{} macros with KaTeX

v1.7.0

  • Switch build system to vite. Should result in smaller bundles.
  • Save default arguments when parsing if the macro signature specifies them e.g. {signature: "O{foo}"}. The defaults are substituted in when expanding the macros with the optional arguments omitted.
  • Preserve position information when comments are modified. (Sometimes, during a parse, but never during a parseMinimal, comments are modified to remove leading whitespace. Previously, modified comments would have their position information deleted. Position information is now preserved.)

v1.6.1

  • Pass VisitInfo as an additional argument ot macroReplacers and environmentReplacers in unifiedLatexToHast.
  • Allow skipping of HTML validation in unifiedLatexToHast.
  • The minted environment parses its contents as a verbatim.

v1.6.0

  • Embellishment tokens are now supported in macro signatures. E.g., a xxx: {signature: "e{^_}"} will allow \xxx_{foo}^{bar} and \xxx^{foo}_{bar} to parse correctly.
  • Stop tokens can now be regular string characters. For example xxx: {signature: "ua"} will allow \xxx YYYaBBB to consume YYY leaving BBB unconsumed.
  • Break after \\ macro when pretty printing (Issue #59)
  • [DEVELOPMENT] Added tsconfig.json files to each test/ folder for more granular control of the typescript settings.

v1.5.0

  • HTML conversion: vspace and hspace now give the amount in a data-amount attribute.
  • HTML conversion: unknown macros now have their arguments wrapped in spans instead of appearing as formatted LaTeX code.
  • Add basic Markdown conversion support.

v1.4.2

  • Avoid slowdown when paring incomplete environments (e.g. \newcommand{\x}{\begin{x}}). This is accomplished by enabling caching in PEGjs.
  • Added " ligature and \paragraph and \subparagraph to HTML conversion.

v1.4.1

  • Many more ligatures added to the HTML converter.
  • Fixed issue #40 where the optional argument to \\ was being parsed even if preceded by a space. (E.g., \\[10pt] and \\ [10pt]) were parsed the same. Not allowing the space should more closely match expected behavior.
  • Bump Prettier to v2.8.8

v1.4.0

  • Better CJS support (now unified is compiled in rather than left as an external dependency. This is needed because unified is ESM-only).
  • minted and listings environments now accept optional arguments and parse their contents verbatim. This makes them much more efficient.

v1.3.0

  • Initial support for parsing and pretty-printing of tikz environments.
  • Added support for xparse u-type arguments.
  • Can now pass an argumentParser attribute for custom argument parsing (instead of relying on an xparse signature)