Documentation
¶
Overview ¶
Package docx provides a Normaliser implementation for Microsoft Word DOCX files. It extracts text content from the XML structure within the ZIP archive format.
Index ¶
Constants ¶
This section is empty.
Variables ¶
This section is empty.
Functions ¶
This section is empty.
Types ¶
type Normaliser ¶
type Normaliser struct{}
Normaliser handles DOCX documents.
func (*Normaliser) Normalise ¶
func (n *Normaliser) Normalise(_ context.Context, raw *domain.RawDocument) (*driven.NormaliseResult, error)
Normalise converts a DOCX document to a normalised document.
func (*Normaliser) Priority ¶
func (n *Normaliser) Priority() int
Priority returns the selection priority.
func (*Normaliser) SupportedConnectorTypes ¶
func (n *Normaliser) SupportedConnectorTypes() []string
SupportedConnectorTypes returns connector types for specialised handling.
func (*Normaliser) SupportedMIMETypes ¶
func (n *Normaliser) SupportedMIMETypes() []string
SupportedMIMETypes returns the MIME types this normaliser handles.
Click to show internal directories.
Click to hide internal directories.