Skip to content

Intelligent Document Parsing

Our AI-powered ComIDP solution intelligently processes and comprehends content in documents and images, including reports, contracts, essays, and other standard files. It identifies and classifies elements, modules, and structures while preserving the original reading logic. By structuring unstructured and semi-structured data, it provides accurate data sources for downstream applications.

Currently supported parsed types include: text, paragraphs, images, standard tables, non-standard tables, tables of contents, ordered lists, titles, unordered lists, formulas, headers, footers, page numbers, image titles, image descriptions, references, table titles, table descriptions, code, and algorithms.

Home

From the homepage, you can directly access the intelligent document parsing feature. The homepage is the center of ComIDP's feature navigation, providing quick access to commonly used modules.

parsing home

Single Document Parsing

Perform intelligent document parsing for a single document.

Select the file you want to parse. Document parsing will automatically begin after successful upload. You can switch between different result options to view the corresponding results. Currently supported options include: Markdown, JSON, TOC, Table, Image, and Text.

extract singleextract result

Batch Document Parsing

Batch parse up to 30 documents.
Click the Bulk button above to switch to batch parsing. Upload the file you want to parse, set the parsing parameters, click Start Parsing, and wait for a moment to see the results of the intelligent document parsing.

extract bulk

Introduction to Parsing JSON Fields

js
success_count: Number of successfully converted pages
total_count: Number of converted pages
version (string): Version number

pages (array): Page data container
├─ angle (integer): Text layout direction (0: horizontal / 90: vertical)
├─ page_id (number): Current page number
├─ image_id (string): Relative directory address of the image corresponding to the current page
├─ width (integer): Document page width (pixels)
├─ height (integer): Document page height (pixels)
├─ content (array): Basic data: Text lines, One of the images 
├─ id (integer): The reading order id of the current paragraph on the page. 
├─ score (integer): The score of the AI ​​layout analysis result, between 0 and 1, with values ​​closer to 1 representing greater credibility. 
├─ text (string): Text. 
├─ type (string): Type, paragraph (paragraph type, including text information such as body text, title, and formulas). 
├─ position (array): The coordinates of the four corner points of the table of contents area, in order: left-top, right-top, right-bottom, and left-bottom. 

catalog (object): Directory tree structure 
│ └─ toc (array) 
│ ├─ pos (array): The coordinates of the four corner points of the table of contents area, in order: left-top, right-top, right-bottom, and left-bottom. 
│ ├─ paragraph_id (integer): The paragraph ID of the title
│ ├─ page_id (integer): The page number of the title (minimum page number is 1)
│ ├─ content (string): The title content
│ └─ type (string): Title type: catalogue, title, figure_title, table_title

metrics (array): Page-level performance metrics
├─ page_image_width (integer): The current page rendering width (pixels)
├─ page_image_height (integer): The current page rendering height (pixels)
├─ page_id (number): The current page number
├─ angle (integer): The text layout direction (0: horizontal / 90: vertical)
└─ image_id (string): The relative directory address of the image corresponding to the current page (same as pages.image_id)

detail (array): Pagination data container
├─ page_id (integer): The page number of the current paragraph
├─ paragraph_id (integer): The id of the current paragraph
├─ text (string): Text. Currently empty if type is image or table
├─ type (string): Type: paragraph (paragraph type, including text information such as body, title, and formulas), image (image type), or table (table type)
├─ image_url (string): Image URL
├─ position (array): The coordinates of the four corner points of the directory area, in order: left-top, right-top, right-bottom, and left-bottom.
├─ tags (array): Indicates whether there is special text in the paragraph. Types include formula and handwritten (not currently supported, empty array, reserved for future use).
│─ cells (array): Cell array, returned only when type is table
│ ├─ row_span (integer): Cell row span, defaults to 1
│ ├─ text (integer): Cell text content
│ ├─ type (integer): Cell represents a table cell
│ ├─ col (integer): Cell column number
│ ├─ col_span (integer): Cell column span, defaults to 1
│ ├─ position (integer): Coordinates of the four corner points of the cell, left-top, right-top, right-bottom, left-bottom.
│ └─ row (integer): Cell row number

Details

TypeDescription
paragraphParagraph
figureFigure
standard_tableStandard table
unstandard_tableNonstandard table
ordered_listOrdered list
cataloguecatalogue
titleTitle
unordered_listUnordered list
formulaFormula
headerHeader
footerFooter
page_numberPage number
figure_titleFigure title
figure_captionFigure description
referenceReferences
table_titleTable title
table_captionTable description
codeCode
algorithmAlgorithm