Documentation
¶
Index ¶
- type Parser
- func (p *Parser) GetSupportedTypes() []string
- func (p *Parser) ParseStream(ctx context.Context, r io.Reader, metadata map[string]any) (<-chan *entity.Document, error)
- func (p *Parser) SetChunkOverlap(overlap int)
- func (p *Parser) SetChunkSize(size int)
- func (p *Parser) SetPreserveComments(preserve bool)
Constants ¶
This section is empty.
Variables ¶
This section is empty.
Functions ¶
This section is empty.
Types ¶
type Parser ¶
type Parser struct {
// contains filtered or unexported fields
}
Parser implements an XML parser using SAX-style parsing
func (*Parser) GetSupportedTypes ¶
GetSupportedTypes returns supported formats
func (*Parser) ParseStream ¶
func (p *Parser) ParseStream(ctx context.Context, r io.Reader, metadata map[string]any) (<-chan *entity.Document, error)
ParseStream implements the dataprep.Parser interface
func (*Parser) SetChunkOverlap ¶
SetChunkOverlap sets the chunk overlap
func (*Parser) SetChunkSize ¶
SetChunkSize sets the chunk size
func (*Parser) SetPreserveComments ¶
SetPreserveComments sets whether to preserve XML comments
Click to show internal directories.
Click to hide internal directories.