Intelligent Document Parsing

Our AI-powered ComIDP solution intelligently processes and comprehends content in documents and images, including reports, contracts, essays, and other standard files. It identifies and classifies elements, modules, and structures while preserving the original reading logic. By structuring unstructured and semi-structured data, it provides accurate data sources for downstream applications.

Currently supported parsed types include: text, paragraphs, images, standard tables, non-standard tables, tables of contents, ordered lists, titles, unordered lists, formulas, headers, footers, page numbers, image titles, image descriptions, references, table titles, table descriptions, code, and algorithms.

Home

From the homepage, you can directly access the intelligent document parsing feature. The homepage is the center of ComIDP's feature navigation, providing quick access to commonly used modules.

parsing home

Single Document Parsing

Perform intelligent document parsing for a single document.

Select the file you want to parse. Document parsing will automatically begin after successful upload. You can switch between different result options to view the corresponding results. Currently supported options include: Markdown, JSON, TOC, Table, Image, and Text.

extract single extract result

Batch Document Parsing

Batch parse up to 30 documents.
Click the Bulk button above to switch to batch parsing. Upload the file you want to parse, set the parsing parameters, click Start Parsing, and wait for a moment to see the results of the intelligent document parsing.

extract bulk

Introduction to Parsing JSON Fields

◆ success_count: Number of successfully converted pages
◆ total_count: Number of converted pages
◆ version (string): Version number

◆ pages (array): Page data container
├─ angle (integer): Text layout direction (0: horizontal / 90: vertical)
├─ page_id (number): Current page number
├─ image_id (string): Relative directory address of the image corresponding to the current page
├─ width (integer): Document page width (pixels)
├─ height (integer): Document page height (pixels)
├─ content (array): Basic data: Text lines, One of the images 
├─ id (integer): The reading order id of the current paragraph on the page. 
├─ score (integer): The score of the AI layout analysis result, between 0 and 1, with values closer to 1 representing greater credibility. 
├─ text (string): Text. 
├─ type (string): Type, paragraph (paragraph type, including text information such as body text, title, and formulas). 
├─ position (array): The coordinates of the four corner points of the table of contents area, in order: left-top, right-top, right-bottom, and left-bottom. 

◆ catalog (object): Directory tree structure 
│ └─ toc (array) 
│ ├─ pos (array): The coordinates of the four corner points of the table of contents area, in order: left-top, right-top, right-bottom, and left-bottom. 
│ ├─ paragraph_id (integer): The paragraph ID of the title
│ ├─ page_id (integer): The page number of the title (minimum page number is 1)
│ ├─ content (string): The title content
│ └─ type (string): Title type: catalogue, title, figure_title, table_title

◆ metrics (array): Page-level performance metrics
├─ page_image_width (integer): The current page rendering width (pixels)
├─ page_image_height (integer): The current page rendering height (pixels)
├─ page_id (number): The current page number
├─ angle (integer): The text layout direction (0: horizontal / 90: vertical)
└─ image_id (string): The relative directory address of the image corresponding to the current page (same as pages.image_id)

◆ detail (array): Pagination data container
├─ page_id (integer): The page number of the current paragraph
├─ paragraph_id (integer): The id of the current paragraph
├─ text (string): Text. Currently empty if type is image or table
├─ type (string): Type: paragraph (paragraph type, including text information such as body, title, and formulas), image (image type), or table (table type)
├─ image_url (string): Image URL
├─ position (array): The coordinates of the four corner points of the directory area, in order: left-top, right-top, right-bottom, and left-bottom.
├─ tags (array): Indicates whether there is special text in the paragraph. Types include formula and handwritten (not currently supported, empty array, reserved for future use).
│─ cells (array): Cell array, returned only when type is table
│ ├─ row_span (integer): Cell row span, defaults to 1
│ ├─ text (integer): Cell text content
│ ├─ type (integer): Cell represents a table cell
│ ├─ col (integer): Cell column number
│ ├─ col_span (integer): Cell column span, defaults to 1
│ ├─ position (integer): Coordinates of the four corner points of the cell, left-top, right-top, right-bottom, left-bottom.
│ └─ row (integer): Cell row number

Details

Type	Description
paragraph	Paragraph
figure	Figure
standard_table	Standard table
unstandard_table	Nonstandard table
ordered_list	Ordered list
catalogue	catalogue
title	Title
unordered_list	Unordered list
formula	Formula
header	Header
footer	Footer
page_number	Page number
figure_title	Figure title
figure_caption	Figure description
reference	References
table_title	Table title
table_caption	Table description
code	Code
algorithm	Algorithm

Intelligent Document Parsing ​

Home ​

Single Document Parsing ​

Batch Document Parsing ​