subscript

package module

v0.1.1 Latest Latest Go to latest Published: Sep 7, 2025 License: MIT Imports: 8 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/zmtcreative/gm-subscript

Links

Open Source Insights

README ¶

Goldmark Subscript Extension

GitHub Tag

A Goldmark extension that adds subscript support using single-tilde syntax (H~2~O). This extension allows you to render subscripts in your Markdown documents while maintaining full compatibility with Goldmark's built-in strikethrough extension.

Installation

go get github.com/zmtcreative/gm-subscript

Configuration

Basic Usage

package main

import (
    "bytes"
    "fmt"

    "github.com/yuin/goldmark"
    "github.com/zmtcreative/gm-subscript"
)

func main() {
    md := goldmark.New(
        goldmark.WithExtensions(
            subscript.Subscript, // Use the pre-configured instance
        ),
    )

    var buf bytes.Buffer
    if err := md.Convert([]byte("H~2~O"), &buf); err != nil {
        panic(err)
    }
    fmt.Print(buf.String()) // Output: <p>H<sub>2</sub>O</p>
}

Alternative Configuration

md := goldmark.New(
    goldmark.WithExtensions(
        subscript.NewSubscript(), // Create a new instance
    ),
)

With Other Extensions

This extension works with other Goldmark extensions, including the built-in strikethrough:

import (
    "github.com/yuin/goldmark"
    "github.com/yuin/goldmark/extension"
    "github.com/zmtcreative/gm-subscript"
)

md := goldmark.New(
    goldmark.WithExtensions(
        extension.GFM,              // Includes strikethrough
        extension.DefinitionList,
        extension.Footnote,
        subscript.Subscript,        // Add subscript support
    ),
)

Syntax Rules

[!TIP]

Use double-tilde for strikethrough everywhere — To reduce ambiguity, try to always use double-tilde delimiters for strikethrough. This reduces the ambiguity when subscript is enabled. It's not a perfect solution — the conflicting syntax is frustrating, but since no one entity is truly setting a definitive standard for markdown, this is what we have.

No whitespace inside subscripts: Content between tildes cannot contain spaces (strikethrough cannot have leading or trailing spaces between tildes either, but it can have spaces between words)
- ✅ H~2~O → H₂O
- ❌ H~2 ab~O → H~~2 ab~~O (not parsed as subscript but will be parsed as strikethrough)
- ❌ H~2 ~O → H~2 ~O (not parsed as subscript OR strikethrough)
- ❌ H~ 2~O → H~ 2~O (not parsed as subscript OR strikethrough)
- ❌ H~ 2 ~O → H~ 2~O (not parsed as subscript OR strikethrough)
Must be preceded by non-whitespace: Subscripts cannot start at the beginning of a line or after whitespace
- ✅ H~2~O → H₂O
- ❌ ~2~O → 2O (parsed as strikethrough -- tilde at beginning of line)
- ❌ H ~2~O → H 2O (parsed as strikethrough -- space before opening tilde)
Single tildes only: Double tildes are reserved for strikethrough
- ✅ body~text~ → body_text subscript (when rules 1-2 are met)
- ✅ body~~text~~ → body~~text~~ (strikethrough)
No nested markdown: Other markdown syntax is not processed inside subscripts
- ✅ Text~word~ → Text_{word}
- For complex formatting, use HTML directly: Textword
- For really complex formatting and output use KaTex or Mathjax (or similar LaTeX rendering)
No nested tildes: Content cannot contain tilde characters (opening and closing tilde are consumed during parsing)
- ✅ H~2~O → H₂O
- ✅ H~2~~O~ → H₂_O — ~2~ and ~O~ are parsed as separate subscripts!
- ❌ H~2~O~ → H₂O~ — only the tildes around the 2 are parsed — the last tilde is just a plain text character
- ❌ ~H~2~O~ → ~~H₂O~~ — here is how it's parsed:
 - The first tilde is at the beginning of the line (which is not allowed), so the parser skips it, and
 - Moves on to the next valid subscript 2, which is rendered as a subscript (and the two tilde delimiters are consumed)
 - Leaving ~H2O~ for the strikethrough parser to process, so the entire H₂O is struck through

[!NOTE]

The subscript parser has higher priority than the strikethrough parser, so it can find and consume valid subscript markdown before the strikethrough parser makes its pass. The subscript parser is a single-pass parser, so there is no backtracking to look for nested tildes. Both parsers consume their tildes. This means that whatever is left over after the subscript parses the markdown is what the strikethrough parser sees on its pass. This can sometimes lead to unexpected rendering results, but is the best we can do given the shared syntax.

Examples

Basic Chemical Formulas

Input:  H~2~O
Output: H<sub>2</sub>O (water)

H₂O (water)

Input:  C~6~H~12~O~6~
Output: C<sub>6</sub>H<sub>12</sub>O<sub>6</sub> (glucose)

C₆H₁₂O₆ (glucose)

Mathematical Expressions

Input:  x~1~, x~(2)~, x~n+1~
Output: x<sub>1</sub>, x<sub>(2)</sub>, x<sub>n+1</sub>

x₁, x₍₂₎, x_n+1

Input:  log~10~(100) = 2
Output: log<sub>10</sub>(100) = 2

log₁₀(100) = 2

Combined with Strikethrough

Input:  H~2~O is ~~not~~ essential for life
Output:

H₂O is ~~not~~ essential for life

// subscripts cannot be preceded by whitespace -- they must be part of other text
Input:  NH~4~ with ~subscript~ and ~~strikethrough~~
Output: NH<sub>4</sub> with <del>subscript</del> and <del>strikethrough</del>

NH₄ with ~~subscript~~ and ~~strikethrough~~

Special Characters, HTML Entities and Unicode

Input:  N~&#x1f47d;~ = R~&#x1f7af;~ × _f_~p~ × n~e~ × _f_~l~ × _f_~i~ × _f_~c~ × L
Output: N<sub>👽</sub> = R<sub>🞯</sub> × <em>f</em><sub>p</sub> × n<sub>e</sub> × <em>f</em><sub>l</sub> × <em>f</em><sub>i</sub> × <em>f</em><sub>c</sub> × L

N_👽 = R_🞯 × f_p × n_e × f_l × f_i × f_c × L

Input:  Text~αβγ123~end
Output: Text<sub>αβγ123</sub>end

Text_αβγ123end

[!NOTE]

Unicode and HTML Entities should render properly on most modern browsers, but a user's font selection might result in some Unicode and HTML Entity output rendering as unknown characters (e.g., �).

Compatibility

This extension is designed to work alongside Goldmark's built-in extension.Strikethrough extension. This requires some strict parsing rules regarding whitespace and embedding other markdown or HTML inside subscripts.

The parsing rules try to ensure proper disambiguation:

Strikethrough Coexistence

The extension tries to intelligently handle the shared use of the ~ character:

Single tildes (following the rules above) → subscripts
Double tildes → strikethrough
Single tildes with spaces between words → strikethrough
Single tildes at line start or after whitespace → strikethrough
Leading and/or trailing spaces on content text between tildes → plain text

Limitations

Simple content only: Subscripts are best suited for simple text, numbers, and basic symbols
No complex formatting: For complex subscripts with multiple formatting options, use HTML  tags directly
Mathematics: For complex mathematical expressions and equations, consider using KaTeX or MathJax instead

Use Cases

This extension is ideal for:

Scientific and chemical formulas
Simple Mathematical notation
Reference numbering
Simple technical documentation

For more complex scenarios requiring nested formatting or advanced mathematical notation, consider using:

Direct HTML  tags for complex formatting
KaTeX or MathJax for advanced mathematics

License

This project is licensed under the MIT License. See the LICENSE.md file for details.

Documentation ¶

Overview ¶

Package subscript provides a Goldmark extension for rendering subscripts using single-tilde syntax.

This extension allows content between single tildes (~text~) to be rendered as HTML subscripts (text), while preserving compatibility with Goldmark's strikethrough extension which uses double tildes (~~text~~).

Usage:

md := goldmark.New(
	goldmark.WithExtensions(
		subscript.NewSubscript(),
	),
)

The extension follows these parsing rules:

Subscripts must not start at the beginning of a line or after whitespace
Content between tildes cannot contain spaces or additional tildes
Empty subscripts (~~ with no content) are not parsed as subscripts

Index ¶

Variables
func NewSubscript(opts ...SubscriptOption) *subscript
func NewSubscriptHTMLRenderer(opts ...html.Option) renderer.NodeRenderer
func NewSubscriptParser() parser.InlineParser
type Node
- func NewSubscriptNode() *Node
- func (n *Node) Dump(source []byte, level int)
- func (*Node) Kind() ast.NodeKind
type SubscriptHTMLRenderer
- func (r *SubscriptHTMLRenderer) RegisterFuncs(reg renderer.NodeRendererFuncRegisterer)
type SubscriptOption

Constants ¶

This section is empty.

Variables ¶

View Source

var KindSubscript = ast.NewNodeKind("Subscript")

KindSubscript is a NodeKind of the Subscript node.

View Source

var Subscript = NewSubscript()

Subscript is a pre-configured subscript extension instance.

View Source

var SubscriptAttributeFilter = html.GlobalAttributeFilter

SubscriptAttributeFilter defines attribute names which subscript elements can have. Uses the global HTML attribute filter for consistency with other HTML elements.

Functions ¶

func NewSubscript ¶

func NewSubscript(opts ...SubscriptOption) *subscript

NewSubscript creates a new subscript extension with the given options.

func NewSubscriptHTMLRenderer ¶

func NewSubscriptHTMLRenderer(opts ...html.Option) renderer.NodeRenderer

NewSubscriptHTMLRenderer returns a new SubscriptHTMLRenderer with the given options.

func NewSubscriptParser ¶

func NewSubscriptParser() parser.InlineParser

NewSubscriptParser returns a new InlineParser that parses subscript expressions.

Types ¶

type Node ¶

type Node struct {
	ast.BaseInline
}

Node represents a subscript node in the AST.

func NewSubscriptNode ¶

func NewSubscriptNode() *Node

NewSubscriptNode returns a new Subscript node.

func (*Node) Dump ¶

func (n *Node) Dump(source []byte, level int)

Dump implements ast.Node.Dump.

func (*Node) Kind ¶

func (*Node) Kind() ast.NodeKind

Kind implements ast.Node.Kind.

type SubscriptHTMLRenderer ¶

type SubscriptHTMLRenderer struct {
	html.Config
}

SubscriptHTMLRenderer renders Subscript nodes to HTML elements.

func (*SubscriptHTMLRenderer) RegisterFuncs ¶

func (r *SubscriptHTMLRenderer) RegisterFuncs(reg renderer.NodeRendererFuncRegisterer)

RegisterFuncs implements renderer.NodeRenderer.RegisterFuncs.

type SubscriptOption ¶

type SubscriptOption func(*subscript)

SubscriptOption configures the subscript extension.

Source Files ¶

View all Source files

subscript.go

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL