html

package
v0.1.9 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Oct 21, 2021 License: Apache-2.0 Imports: 4 Imported by: 0

Documentation

Index

Constants

This section is empty.

Variables

This section is empty.

Functions

func ParseHTML

func ParseHTML(body []byte) (*goquery.Document, error)

ParseHTML 解析 html

Types

type HTMLElement

type HTMLElement struct {
	// Name is the name of the tag
	Name string

	// DOM is the goquery parsed DOM object of the page. DOM is relative
	// to the current HTMLElement
	DOM *goquery.Selection

	// Index stores the position of the current element within
	// all the elements matched by an OnHTML callback
	Index int

	Node *html.Node
}

HTMLElement is the representation of a HTML tag.

func NewHTMLElementFromSelectionNode

func NewHTMLElementFromSelectionNode(s *goquery.Selection, n *html.Node, index int) *HTMLElement

NewHTMLElementFromSelectionNode creates a HTMLElement from a goquery.Selection Node.

func (HTMLElement) Attr

func (he HTMLElement) Attr(key string) string

Attr returns the selected attribute of a HTMLElement or empty string if no attribute found

func (HTMLElement) Child

func (he HTMLElement) Child(selector string, num int) *HTMLElement

Child returns the numth matched child element. num starts at 1, not at 0.

func (HTMLElement) ChildAttr

func (he HTMLElement) ChildAttr(selector, attrName string) string

ChildAttr returns the stripped text content of the first matching element's attribute.

func (HTMLElement) ChildText

func (he HTMLElement) ChildText(selector string) string

ChildText returns the concatenated and stripped text content of the matching elements.

func (HTMLElement) ChildrenAttr

func (he HTMLElement) ChildrenAttr(selector, attrName string) []string

ChildrenAttr returns the stripped text content of all the matching element's attributes.

func (HTMLElement) ChildrenText

func (he HTMLElement) ChildrenText(selector string) []string

ChildrenText returns the stripped text content of all the matching elements.

func (HTMLElement) Each

func (he HTMLElement) Each(selector string, callback func(int, *HTMLElement))

Each iterates over the elements matched by the first argument and calls the callback function on every HTMLElement match.

func (HTMLElement) FirstChild

func (he HTMLElement) FirstChild(selector string) *HTMLElement

FirstChild returns the first child element that matches the selector.

func (HTMLElement) InnerHTML

func (he HTMLElement) InnerHTML() (string, error)

InnerHTML gets the HTML contents of the first element in the set of matched elements. It includes text and comment nodes.

func (HTMLElement) LastChild

func (he HTMLElement) LastChild(selector string) *HTMLElement

LastChild returns the last child element that matches the selector.

func (HTMLElement) OuterHTML

func (he HTMLElement) OuterHTML() (string, error)

OuterHtml returns the outer HTML rendering of the first item in the selection - that is, the HTML including the first element's tag and attributes.

func (HTMLElement) Parent

func (he HTMLElement) Parent() *HTMLElement

Parent returns the direct parent element.

func (HTMLElement) Parents

func (he HTMLElement) Parents() []*HTMLElement

Parents returns all parent elements.

func (HTMLElement) Text

func (he HTMLElement) Text() string

Text gets the combined text contents of each element in the set of matched elements, including their descendants.

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL