Software to Convert Scanned PDF to Editable Text

Click For Summary

Discussion Overview

The discussion centers around the challenge of converting scanned PDF files, which contain images of text, into editable text formats. Participants explore software options and techniques, particularly focusing on Optical Character Recognition (OCR) technology.

Discussion Character

  • Technical explanation
  • Debate/contested

Main Points Raised

  • One participant seeks software that can convert a scanned PDF (text mode) into an editable format, specifically mentioning issues with modifying data in the converted document.
  • Another participant suggests that OCR technology is necessary for converting image-based PDFs into text files.
  • A third participant reiterates the need for OCR, emphasizing that bitmap images cannot be directly converted back to text without this technology.
  • A later reply expresses frustration with the OCR results, noting that the converted text contains unwanted dots after each word, making it difficult to edit.

Areas of Agreement / Disagreement

Participants generally agree on the necessity of OCR for the conversion process, but there is disagreement regarding the effectiveness of the OCR results, particularly concerning the presence of extraneous characters in the output.

Contextual Notes

Limitations include the potential variability in OCR software performance and the specific characteristics of the scanned documents, which may affect conversion quality.

paramahamsa
Messages
14
Reaction score
0
Can anybody tell me the software name which changes a pdf file(text mode) which has been scanned and kept in pdf format

actually i want to modify the data in that image

ie one textbook has been scanned and that is put in pdf format
I changed it into word with the converters(pdf to word) but even i can't able to modify the data in that.because it is coming in image form only


i want the software for that in internet
 
Computer science news on Phys.org
The only thing I've ever heard of similar to this is OCR when you scan an image to a text file. There may very well be something that converts a "picture" .pdf file back to a text format.
 
You'll need OCR as said.

If it's I am the form of a bitmap/image, you can't convert it back directly.
 
thank you all
i tried that also
but all the text that has converted is having dots after each worr


looks irritating to see i can't modify that dot by deleating it
 

Similar threads

  • · Replies 5 ·
Replies
5
Views
4K
  • · Replies 7 ·
Replies
7
Views
4K
  • · Replies 8 ·
Replies
8
Views
8K
  • · Replies 4 ·
Replies
4
Views
4K
  • · Replies 2 ·
Replies
2
Views
2K
Replies
1
Views
3K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 6 ·
Replies
6
Views
4K
Replies
1
Views
3K
  • · Replies 4 ·
Replies
4
Views
9K