Issues Addressed
・Fixed the issue of missing annotations.
・Fixed the crash issue with special documents during PDF to Excel conversion.
・Fixed the issue of garbled text when copying content from searchable PDFs.
・Fixed the crash issue with special documents during PDF to Word conversion.
・Fixed the issue with box layout effects when converting PDF to Word.
New Features
・Added support for converting images to Json.
Issues Addressed
・Optimize table recognition effect.
・Fixed the crash caused by some special hyperlink types documents.
・Optimize the problem of image layer loss.
・Fixed the problem that the source file resources occupied when converting PDF to searchable PDF are not released.
・Added support for converting scanned PDF files to searchable PDF files.
・Added support for converting images to Word, Excel, PPT, HTML, TXT, CSV, and RTF.
・Added support for table recognition in the flowing text layout when converting PDF to Word.
・Added support for selecting pages when extracting data from PDFs.
・Added support for canceling conversions when extracting data from PDFs.
・Added support for extracting text, images, and tables simultaneously in the PDF data extraction function.
・Added support for performing OCR when converting PDF to CSV.
・Optimized the conversion effect when converting PDF to Excel and outputting the entire PDF document content in one worksheet.
・Fixed the issue of conversion failure when the incoming file path contains the '%' symbol.
・Added support for keeping the PDF annotations like highlight, underline, strikeout, and squiggly when converting PDF to Word, PPT, and HTML.
・Added support for keeping the hyperlinks in PDFs when converting PDF to Word, PPT, and HTML.
・Added support for keeping the PDF annotations like underline and strikeout when converting PDF to Excel (Only takes effect in extracting text).
・Added an API to extract the tables in a PDF file to JSON.
・Added an API to extract the text in a PDF file to JSON.
・Optimized the PDF to Excel OCR effect: Fix the issue where large images could cover the document.
・Fixed the issue that the default parameter would not be used when a null value passed in during converting.
・Added an API to set the file path of the OCR model: (void)setOCRModelPath:(NSString *)modePath
.
・Fixed the issue that table recognition would output the negative coordinate data.
・Fixed the issue that table recognition could not properly release memory.
・Optimized the process logic of table recognition. If failed to recognize the standard tables, the non-standard table algorithm will be used for recognition again to improve table recognition success rate.
More Information
・Guides for OCR
・Added support for recognizing tables when performing OCR on files.
・Fixed issues when converting PDF to HTML that can't jump to specific outlines in some cases and comment loss.
・Fixed the issue that setting negative dpi values would cause a crash when converting PDF to JPG.
・Optimized the slow response issue when clicking cancel when converting PDF to RTF.
・Optimized the accuracy of OCR.
・Optimized the capacity of the OpenCV library on the Mac platform.
・Optimized converting PDFs to HTML files with smaller file sizes.
・Added support for Retain Flowing Text when converting PDF to Word. There are two layout options to choose from: Retain Flowing Text and Retain Page Layout. Retain Flowing Text is more convenient for overall document editing than the Retain Page Layout.
・Added support for a factory method for instantiating conversion classes, making it easier to instantiate other conversion classes through factory methods.
・Added support for multiple languages in OCR (Specific languages are listed in the documentation).
・Fixed an issue that the DPI field was missing after setting the DPI when converting PDF to JPG.
・Added support for OCR.
・Added support for setting whether to contain images or annotations when converting.
・Added support for setting DPI when converting PDF to image.
・Added support for merging multiple tables into one file when converting PDF to CSV.
・Fixed an issue that the scanned PDF cannot be converted when converting PDF to Excel.
・Added support for converting PDF to Rich Text Format documents, including text and images.
・Added support for converting PDF to HTML, including single-page and multiple-page.
・Added support for tables when converting PDF to PPT.
・Added support for images when converting PDF to XLSX.
・Added support for text alignment when converting PDF to XLSX.
・Added support for converting all tables on one page or all tables in the entire document to one sheet when converting PDF to XLSX.
・Added support for non-table content when converting PDF to XLSX.
・Added support for converting PDF to high-quality image formats, including PNG and JPEG. All image quality and resolution will remain intact.
・Added error messages for conversion failure.
・Added support for arm64 architecture of the Apple M1 chip.
・Fixed an issue with being incompatible with WPS.
・Fixed an issue with the mismatch of a part of fonts.
・Optimized performance.
・Added support for extracting only tables from PDF accurately and converting them to XLSX, and one table is converted to one sheet.
・Added support for extracting only tables from PDF accurately and converting them to CSV, and one table is converted to one CSV file.
・Optimized performance.
・Added support for table recognition when converting PDF to Word, completely preserving original content, structure, and style.
・Optimized performance.
This is the first release of ComPDFKit Conversion SDK, it is a new Rapid Development Kit for iOS/Android/Windows/Mac platforms which focuses on helping developers rapidly integrate powerful ComPDFKit Conversion technology into their own apps.
ComPDFKit Conversion SDK provides an Objective-C API (compatible with Swift) for iOS/Mac and its features include:
Convert PDF documents to Word documents.
・Reconstruct content from PDF files into reusable data.
・Reconstruction (recover page layout, columns, formatting, graphics, and preserve text flow).
・Support mapping conversion to split-column function.
・Support the function of combining letters into a string of text.
・Support floating insertion such as picture rotation cutting.
・Support converting partial Math or Chemistry function.
・Support font size, color, bold, italic, and underline recognition.
Convert PDF documents to Excel documents.
・Support data mapping to Excel cells.
・Support recognition of a small number, currency, and other formats.
・Support multiple cells merge function.
Convert PDF documents to PPT documents.
・Convert each page to an editable slide.
・Support converting text to a PowerPoint text box.
・Support picture rotation cutting and other floating insertion.
・Support converting partial Math or Chemistry function.