What is it all about?
PDF Extractor SDK converts PDF to text, PDF to XML, PDF to CSV, extracts images from PDF, extracts information about PDF files in .NET and ActiveX interfaces.
* converts PDF to plain text (including invisible text extraction);
* converts tables in PDF to CSV (Excel) and XML files; extracts PDF file metadata and information;
* extracts embedded images from PDF document; .NET and ActiveX interfaces;
* 100% managed C# code.
Video & Images
* Advanced text search with support for regular expressions and more. * Image to text extraction – convert OCR in PDF to text. * Repair damaged text when PDF shows correct text but copies damaged text. * Extract PDF file author, title, description and other metadata. * Extract and convert tables from PDF to CSV or XML. * Merge or split document for easier management, extract text from pdf c#. * Extract embedded images from PDF .NET (2.00 to 4.50) and ActiveX interfaces emulation, C# extract image from PDF In this version 220.127.116.1179 (July 18, 2018) * Greatly improved the line removing OCR image preprocessing filters. * SearchablePDFMaker: fixed hanging on processing PDF documents with a large count of vector objects. * Fixed bug in RotationAngle property when processing already rotated PDF documents. * Added new OCRAnalyzer class that can help to find an optimal combination of OCR image preprocessing filters. See source code examples. * Other minor bug fixes and improvements.
|For more details:||https://secure.bytescout.com|