Updates

ComPDFKit Conversion SDK 1.6.0

By ComPDFKit | Fri. 17 May. 2024
ReleaseOCRConversion SDK

We are delighted to announce the release of ComPDFKit Conversion SDK 1.6.0 for Mac, iOS, Android, Windows, and Server! In this version, we have added a new feature on Mac and Windows platforms: optical character recognition (OCR). With OCR, you’ll be able to save time, cost, and other resources by utilizing automated data extraction and storage capabilities. 

 

Additional new features added in this release include:

         - Support merging multiple tables into one file when converting PDF to CSV.

         - Support for setting DPI when converting PDF to image.

         - Support for setting whether to contain images or annotations when converting.

 

For a full list of changes, please refer to our ComPDFKit Conversion SDK changelog.

 

 

What Is OCR

 

Optical Character Recognition (OCR) is the technology that converts an image and a scanned file into a computer-readable text format. When you have an image file or a scanned file and want to extract the text for editing, you can use OCR to extract text and store the text data, and then edit the text.

 

 

ComPDFKit OCR Technology

 

When working with PDF files, you may encounter inaccessible scanned PDF files. In this case, you can use our OCR component on Mac and Windows platforms to extract text to create editable PDF files, and then use annotation, selection, search, and other functions.

 

Our OCR technology can extract text from handwritten documents and letters, allowing for a more efficient and accurate way to digitize this information without needing to manually input the text.

 

 

Convert Multiple PDF Tables into a CSV File

 

When converting files, you may encounter a PDF file that contains multiple forms. In this case, you can use ComPDFKit to convert the forms in this PDF file into one CSV file. With this operation, you do not need to manually adjust, which is convenient for viewing.

 

Here we take Kotlin as an example.

val cPDFConvert = CPDFConverterCsv(context, uri, "")

val params = CPDFConvertCsvOptions().apply {
    isMergeCsv = true
}

val result: ConvertError = cPDFConvert.convert(outputDir, outputFilenameNoSuffix, params, pageArrays, 
onHandle = onHandleCal, 
onProgress = onProgressCal, 
onPost = onPostCal)



Introduction to DPI

 

Dots per inch (DPI) is a measure of video or image scanner dot density. The higher the number of DPI, the better-quality image you’ll see.

 

ComPDFKit provides DPI settings for you, you can choose the DPI value you need when converting PDF to images. Setting the DPI can also optimize the printing experience. Since some PDF files contain images, the DPI of the image is usually higher than the DPI printed by the printer, which may cause slow printing speed. In this case, the DPI setting can help you solve the problem.

 

Here we take Kotlin as an example.

val cPDFConvert = CPDFConverterImg(context, uri, "")

val params = CPDFConvertImgOptions().apply {
     imageDpi = true
     imgType = ImgType.JPEG
     isContainAnnotations = true
}

val result: ConvertError = cPDFConvert.convert(outputDir, outputFilenameNoSuffix, params, pageArrays, 
onHandle = onHandleCal, 
onProgress = onProgressCal, 
onPost = onPostCal)

 

 

Whether to Export Annotations or Images When Converting

 

If you make annotations on a PDF file when browsing it, you may want to export the annotations along with it. Our new function allows you to choose whether to export annotations during conversion, and you can also choose whether to export images.

 

Here we take Kotlin as an example.

val cPDFConvert = CPDFConverterWord(context, uri, "")

val params = CPDFConvertWordOptions().apply {
     isContainImages = true
     isContainAnnotations = true
}

val result: ConvertError = cPDFConvert.convert(outputDir, outputFilenameNoSuffix, params, pageArrays, 
onHandle = onHandleCal, 
onProgress = onProgressCal, 
onPost = onPostCal)

 

 

Optimization

 

We have optimized the function of converting PDF to Excel. Expand the range of PDF file types that can be converted. Currently, ComPDFKit can support the conversion of scanned PDF files to Excel files.

 

 

Conclusion


In this blog, we introduced the updated version and the newly optimized function. You are welcome to your trial of our ComPDFKit, and you can also access our official website and give us your feedback. We will continue to optimize to provide you with the best Conversion SDK.