Skip to content
Guides

Convert PDF to HTML

Overview

ComPDFKit Conversion SDK provides the PDF to HTML function, which can convert PDF files to HTML files while maintaining the layout and format of the original document, allowing users to browse and view the document on Web.

Note

When converting PDF to HTML format, ComPDFKit Conversion SDK provides the following four options to create HTML files:

OptionsDescription
HtmlOption.SinglePageConvert the entire PDF file into a single HTML file, where all PDF pages are connected in sequence according to page number, displayed on the same HTML page.
HtmlOption.SinglePageNavigationByBookmarksConvert the PDF file into a single HTML file with an outline for navigation at the beginning of the HTML page. Still, all PDF pages are connected in sequence according to page number, displayed on the same HTML page.
HtmlOption.MultiplePagesConvert the PDF file into multiple HTML files. Each HTML file corresponds to a PDF page, and users can navigate to the next HTML file via a link at the bottom of the HTML page.
HtmlOption.MultiplePagesSplitByBookmarksConvert the PDF file into multiple HTML files. Each HTML file corresponds to a PDF page, and users can navigate to the next HTML file via a link at the bottom of the HTML page. The links of all the HTML files are presented in an outline HTML file for navigation.

Sample

This sample demonstrates how to convert PDFs to HTML files.

python
options = ConvertOptions()
error_code = PDFToOffice.start_pdf_to_html("sample.pdf", "", "path/to/output", options, callback)

if error_code == ErrorCode.Success:
    print("Convert success")