包详细信息

@unified-latex/unified-latex-util-parse

siefkenj34.1kMIT1.8.3

Tools for manipulating unified-latex ASTs

pegjs, latex, parser, prettier

自述文件

unified-latex-util-parse

What is this?

Functions parse strings to a unified-latex Abstract Syntax Tree (AST).

When should I use this?

If you have a string that you would like to parse to a unified-latex Ast.Ast, or if you are building a plugin for unified() that manipulates LaTeX.

Install

npm install @unified-latex/unified-latex-util-parse

This package contains both esm and commonjs exports. To explicitly access the esm export, import the .js file. To explicitly access the commonjs export, import the .cjs file.

Plugins

unifiedLatexAstComplier

Unified complier plugin that passes through a LaTeX AST without modification.

Usage

unified().use(unifiedLatexAstComplier)

Type

Plugin<void[], Ast.Root, Ast.Root>

function unifiedLatexAstComplier(): void;

unifiedLatexFromString

Parse a string to a LaTeX AST.

Usage

unified().use(unifiedLatexFromString[, options])

options

{ mode?: "math" | "regular"; macros?: MacroInfoRecord; environments?: EnvInfoRecord; flags?: { atLetter?: boolean; expl3?: boolean; autodetectExpl3AndAtLetter?: boolean; }; }

Type

Plugin<{ mode?: "math" | "regular"; macros?: MacroInfoRecord; environments?: EnvInfoRecord; flags?: { atLetter?: boolean; expl3?: boolean; autodetectExpl3AndAtLetter?: boolean; }; }[], string, Ast.Root>

function unifiedLatexFromString(options: {
  mode?: "math" | "regular";
  macros?: MacroInfoRecord;
  environments?: EnvInfoRecord;
  flags?: {
    atLetter?: boolean;
    expl3?: boolean;
    autodetectExpl3AndAtLetter?: boolean;
  };
}): void;

unifiedLatexFromStringMinimal

Parse a string to a LaTeX AST with no post processing. For example, no macro arguments will be attached, etc.

Usage

unified().use(unifiedLatexFromStringMinimal[, options])

options

PluginOptions

Type

Plugin<PluginOptions[], string, Ast.Root>

function unifiedLatexFromStringMinimal(options: PluginOptions): void;

unifiedLatexProcessAtLetterAndExplMacros

Unified plugin to reprocess macros names to possibly include @, _, or :. This plugin detects the \makeatletter and \ExplSyntaxOn commands and reprocesses macro names inside of those blocks to include those characters.

Usage

unified().use(unifiedLatexProcessAtLetterAndExplMacros[, options])

options

{ atLetter?: boolean; expl3?: boolean; autodetectExpl3AndAtLetter?: boolean; }

Type

Plugin<{ atLetter?: boolean; expl3?: boolean; autodetectExpl3AndAtLetter?: boolean; }[], Ast.Root, Ast.Root>

function unifiedLatexProcessAtLetterAndExplMacros(options: {
  atLetter?: boolean;
  expl3?: boolean;
  autodetectExpl3AndAtLetter?: boolean;
}): (tree: Ast.Root) => void;

unifiedLatexProcessMacrosAndEnvironmentsWithMathReparse

Unified plugin to process macros and environments. Any environments that contain math content are reparsed (if needed) in math mode.

Usage

unified().use(unifiedLatexProcessMacrosAndEnvironmentsWithMathReparse[, options])

options

{ environments: EnvInfoRecord; macros: MacroInfoRecord; }

Type

Plugin<{ environments: EnvInfoRecord; macros: MacroInfoRecord; }[], Ast.Root, Ast.Root>

function unifiedLatexProcessMacrosAndEnvironmentsWithMathReparse(options: {
  environments: EnvInfoRecord;
  macros: MacroInfoRecord;
}): (tree: Ast.Root) => void;

unifiedLatexReparseMath

Reparse math environments/macro contents that should have been parsed in math mode but weren't.

Usage

unified().use(unifiedLatexReparseMath[, options])

options

{ mathEnvs: string[]; mathMacros: string[]; }

Type

Plugin<{ mathEnvs: string[]; mathMacros: string[]; }[], Ast.Root, Ast.Root>

function unifiedLatexReparseMath(options: {
  mathEnvs: string[];
  mathMacros: string[];
}): (tree: Ast.Root) => void;

Functions

getParser(options)

Returns the default unified-latex parser, or create a new one with the provided unifiedLatexFromString options

function getParser(options: {
  mode?: "math" | "regular";
  macros?: Ast.MacroInfoRecord;
  environments?: Ast.EnvInfoRecord;
  flags?: {
    atLetter?: boolean;
    expl3?: boolean;
    autodetectExpl3AndAtLetter?: boolean;
  };
}): FrozenProcessor<Ast.Root, Ast.Root, Ast.Root, void>;

Parameters

Param Type Description
options Omitted Plugin options of unifiedLatexFromString plugin.

parse(str)

Parse the string into an AST.

function parse(str: String): Ast.Root;

Parameters

Param Type
str String

parseMath(str)

Parse str into an AST. Parsing starts in math mode and a list of nodes is returned (instead of a "root" node).

function parseMath(str: string | Ast.Ast): Ast.Node[];

Parameters

Param Type
str `string \ Ast.Ast`

parseMathMinimal(str)

Parse str to an AST with minimal processing. E.g., macro arguments are not attached to macros, etc. when parsed with this function.

The parsing assumes a math-mode context, so, for example, ^ and _ are parsed as macros (even though arguments are not attached to them).

function parseMathMinimal(str: String): Ast.Node[];

Parameters

Param Type
str String

parseMinimal(str)

Parse str to an AST with minimal processing. E.g., macro arguments are not attached to macros, etc. when parsed with this function.

function parseMinimal(str: String): Ast.Root;

Parameters

Param Type
str String

`unifiedLatexReparseMathConstructPlugin({ mathEnvs, mathMacros,

})`

Construct the inner function for the unifiedLatexReparseMath plugin. This function should not be used by libraries.

function unifiedLatexReparseMathConstructPlugin({
  mathEnvs,
  mathMacros,
}: {
  mathEnvs: string[];
  mathMacros: string[];
}): (tree: Ast.Root) => void;

Parameters

Param Type
{ mathEnvs, mathMacros, } Omitted

Types

PluginOptions

export type PluginOptions =
    | {
          mode?: "math" | "regular";
          macros?: MacroInfoRecord;
          environments?: EnvInfoRecord;
          flags?: {
              /**
               * Whether to parse macros as if `\makeatletter` is set (i.e., parse `@` as a regular macro character)
               */
              atLetter?: boolean;
              /**
               * Whether to parse macros as if `\ExplSyntaxOn` is set (i.e., parse `_` and `:` as a regular macro character)
               */
              expl3?: boolean;
              /**
               * Attempt to autodetect whether there are macros that look like they should contain `@`, `_`, or `:`.
               * Defaults to `false`.
               */
              autodetectExpl3AndAtLetter?: boolean;
          };
      }
    | undefined;

更新日志

unified-latex Changelog

v1.8.3

  • Support \ref in PreTeXt conversion
  • Better use of UnifiedJS to parse but not print LaTeX
  • Support for \verb, \textsuperscript, \textsubscript, \sout, and \" i in HTML conversion

v1.8.2

  • Upgraded dependencies

v1.8.1

  • Changed Peggy to implement a caching parser to prevent large slowdown on some files.

v1.8.0

  • Added initial PreTeXt conversion support
  • Upgraded deps
  • Added amsart macros
  • Consume the whitespace after special character macros when expanding ligatures. For example \o y produces øy instead of ø y
  • Fix signatures of \hyphenation

v1.7.1

  • Types fix for @unified-latex/unified-latex-types
  • Fixed AST when expanding \sysdelim macros for rendering \systeme{} macros with KaTeX

v1.7.0

  • Switch build system to vite. Should result in smaller bundles.
  • Save default arguments when parsing if the macro signature specifies them e.g. {signature: "O{foo}"}. The defaults are substituted in when expanding the macros with the optional arguments omitted.
  • Preserve position information when comments are modified. (Sometimes, during a parse, but never during a parseMinimal, comments are modified to remove leading whitespace. Previously, modified comments would have their position information deleted. Position information is now preserved.)

v1.6.1

  • Pass VisitInfo as an additional argument ot macroReplacers and environmentReplacers in unifiedLatexToHast.
  • Allow skipping of HTML validation in unifiedLatexToHast.
  • The minted environment parses its contents as a verbatim.

v1.6.0

  • Embellishment tokens are now supported in macro signatures. E.g., a xxx: {signature: "e{^_}"} will allow \xxx_{foo}^{bar} and \xxx^{foo}_{bar} to parse correctly.
  • Stop tokens can now be regular string characters. For example xxx: {signature: "ua"} will allow \xxx YYYaBBB to consume YYY leaving BBB unconsumed.
  • Break after \\ macro when pretty printing (Issue #59)
  • [DEVELOPMENT] Added tsconfig.json files to each test/ folder for more granular control of the typescript settings.

v1.5.0

  • HTML conversion: vspace and hspace now give the amount in a data-amount attribute.
  • HTML conversion: unknown macros now have their arguments wrapped in spans instead of appearing as formatted LaTeX code.
  • Add basic Markdown conversion support.

v1.4.2

  • Avoid slowdown when paring incomplete environments (e.g. \newcommand{\x}{\begin{x}}). This is accomplished by enabling caching in PEGjs.
  • Added " ligature and \paragraph and \subparagraph to HTML conversion.

v1.4.1

  • Many more ligatures added to the HTML converter.
  • Fixed issue #40 where the optional argument to \\ was being parsed even if preceded by a space. (E.g., \\[10pt] and \\ [10pt]) were parsed the same. Not allowing the space should more closely match expected behavior.
  • Bump Prettier to v2.8.8

v1.4.0

  • Better CJS support (now unified is compiled in rather than left as an external dependency. This is needed because unified is ESM-only).
  • minted and listings environments now accept optional arguments and parse their contents verbatim. This makes them much more efficient.

v1.3.0

  • Initial support for parsing and pretty-printing of tikz environments.
  • Added support for xparse u-type arguments.
  • Can now pass an argumentParser attribute for custom argument parsing (instead of relying on an xparse signature)