rawDocumentFileType property

String? rawDocumentFileType
getter/setter pair

This is used when DocAI was not used to load the document and parsing/ extracting is needed for the inline_raw_document.

For example, if inline_raw_document is the byte representation of a PDF file, then this should be set to: RAW_DOCUMENT_FILE_TYPE_PDF. Possible string values are:

  • "RAW_DOCUMENT_FILE_TYPE_UNSPECIFIED" : No raw document specified or it is non-parsable
  • "RAW_DOCUMENT_FILE_TYPE_PDF" : Adobe PDF format
  • "RAW_DOCUMENT_FILE_TYPE_DOCX" : Microsoft Word format
  • "RAW_DOCUMENT_FILE_TYPE_XLSX" : Microsoft Excel format
  • "RAW_DOCUMENT_FILE_TYPE_PPTX" : Microsoft Powerpoint format
  • "RAW_DOCUMENT_FILE_TYPE_TEXT" : UTF-8 encoded text format
  • "RAW_DOCUMENT_FILE_TYPE_TIFF" : TIFF or TIF image file format

Implementation

core.String? rawDocumentFileType;