Scanning Text & Graphics for PDF Conversion

PDF Formatted Text and Graphics

PDF Formatted Text and Graphics (formerly known as PDF Normal): is an exact electronic representation of the document. With this format, you can maintain the original font and format of the document. This is the highest-quality PDF format - it's clearer, smaller, and more versatile than the other two types. PDF formatted text and graphics has full functionality for viewing, linking, searching, and navigating. 

SunTec converts hardcopy documents to PDF formatted text and graphics by scanning the pages, OCRing the text, and distilling the results to the PDF format. We also convert an electronic files into PDF formatted text and graphics files. This format, also known as "PDF Normal", is the best PDF option, but it's the hardest and most complicated to produce.



OCR PDF Conversion This PDF formatted text and graphics conversion in effect is re-authoring or re-generating the documents, and involves the following steps:

  • All copy (including in-diagram copy) converted to formatted text
  • All graphic elements precisely preserved
  • Page layout replicated
  • Text accuracy up to 99.995%
  • Colored text and shaded boxes reproduced
  • Color and grayscale images are optimized for clarity and minimal file size before reassembly on the page
  • Fonts and paragraph attributes retained
  • Where possible, vector graphics are generated to replace bitmap graphics for a smaller file size and the cleanest-possible appearance, as well as enhanced screen and print performance

 

OCR PDF Conversion Features of this conversion process include:

  • Small files : 4-6 kb per page for simple documents
  • Ideal for Web applications, especially where slow connection speeds are a factor
  • Excellent print performance
  • Brochure Grade conversions : for high-value documents, or where the smallest possible file size is crucial


PDF File Type Comparison
  Image Image + Searchable Text PDF Normal (Formatted Text & Graphics)
Accuracy Very high 
(Page is retained as image)
Very high
(Page is retained as image)
High
(in effect, re-authoring the document)
Text searchability No Yes  Yes
File size Large 
(Typically, 40-50 KB at 300 dpi without grayscale or color images)
Large 
(Typically, 50-60 KB at 300 dpi without grayscale or color images)
Small size
(Typically, 4–6 kb per page for simple documents)
Typical Application Budget friendly archiving Full-text search for bitonal files Tiny but rich files - great for the web
Cost  Low Medium High
OCR PDF Conversion
Issues when choosing the type of PDF
  • Bandwidth
  • Text Searchability
  • Color or half-tone images
  • Document size

SunTec Web Services Pvt. Ltd.

Floor 3, Vardhman Times Plaza
Plot 13, DDA Community Center
Road 44, Pitampura
New Delhi - 110 034, INDIA

Phone : +91 11 4264 4425
               +91 11 4264 4426
               +91 11 4264 4427
               +91 11 4264 4428
               +91 11 4264 4429

Fax(India): +91 11 4264 4430
Fax (US) : +1 646 365 3077

Email : info@data-entry-india.com