A class that extends the CheerioWebBaseLoader class. It represents a loader for loading web pages from the Hacker News website.

Hierarchy (view full)

Constructors

Properties

caller: AsyncCaller
timeout: number
webPath: string
selector?: SelectorType
textDecoder?: TextDecoder

Methods

  • An asynchronous method that loads the web page. If the webPath includes "item", it calls the loadComments() method to load the comments from the web page. Otherwise, it calls the loadResults() method to load the results from the web page.

    Returns Promise<Document[]>

    A Promise that resolves to an array of Document instances.

  • Loads the documents and splits them using a specified text splitter.

    Parameters

    • splitter: TextSplitter = ...

    Returns Promise<Document[]>

    A Promise that resolves with an array of Document instances, each split according to the provided TextSplitter.

  • A static method that dynamically imports the Cheerio library and returns the load function. If the import fails, it throws an error.

    Returns Promise<{
        load: ((content, options?, isDocument?) => CheerioAPI);
    }>

    A Promise that resolves to an object containing the load function from the Cheerio library.

  • Fetches web documents from the given array of URLs and loads them using Cheerio. It returns an array of CheerioAPI instances.

    Parameters

    • urls: string[]

      An array of URLs to fetch and load.

    • caller: AsyncCaller
    • timeout: undefined | number
    • Optional textDecoder: TextDecoder
    • Optional options: CheerioOptions

    Returns Promise<CheerioAPI[]>

    A Promise that resolves to an array of CheerioAPI instances.