A class that extends the BufferLoader class. It represents a document loader that loads documents from DOCX files.

Hierarchy (view full)

Constructors

Properties

filePathOrBlob: string | Blob

Methods

  • Method that reads the buffer contents and metadata based on the type of filePathOrBlob, and then calls the parse() method to parse the buffer and return the documents.

    Returns Promise<Document[]>

    Promise that resolves with an array of Document objects.

  • Loads the documents and splits them using a specified text splitter.

    Parameters

    • splitter: TextSplitter = ...

    Returns Promise<Document[]>

    A Promise that resolves with an array of Document instances, each split according to the provided TextSplitter.

  • A method that takes a raw buffer and metadata as parameters and returns a promise that resolves to an array of Document instances. It uses the extractRawText function from the mammoth module to extract the raw text content from the buffer. If the extracted text content is empty, it returns an empty array. Otherwise, it creates a new Document instance with the extracted text content and the provided metadata, and returns it as an array.

    Parameters

    • raw: Buffer

      The raw buffer from which to extract text content.

    • metadata: Document

      The metadata to be associated with the created Document instance.

    Returns Promise<Document[]>

    A promise that resolves to an array of Document instances.

  • Static method that imports the readFile function from the fs/promises module in Node.js. It is used to dynamically import the function when needed. If the import fails, it throws an error indicating that the fs/promises module is not available in the current environment.

    Returns Promise<{
        readFile: {
            (path, options?): Promise<Buffer>;
            (path, options): Promise<string>;
            (path, options?): Promise<string | Buffer>;
        };
    }>

    Promise that resolves with an object containing the readFile function.