
The cheap/low budget ones treat each line of text as a separate entity, usually putting the text into a text box to control the location of the text. You want to look for an OCR tool that treats paragraphs as a single unit. Instead, find a fully functional OCR program that keeps formatting on extracted text. Once you recreate the formatting after the first paste you can use Copy-Formatting and Paste-Formatting to put it onto subsequently pasted text.Īnother option is to not use Office to extract the text, both Word and OneNote only have very primitive text-only "OCR" functions.
#Reveal text formatting in word pdf
If you are able to copy text directly from the PDF the text tends to be unformatted text only. The fundamental problem is that PDF is an image based file format. You can also try using Paste Special,, there are more options to preserve formatting. Obvious hint: make sure to keep the newly recreated DOC files for future editing. Obvious question: don't you have access to the original document files? pdfs in question were created using the previous versions of Word, some as recently as 2015 and none older than 2011. So what are weĭoing wrong? What option are we missing? What alternatives do we have? I tried changing Pasting between documents when styles conflict to "Use Destination Styles (Default)" but that didn't work, either.Īll that myself and my staff can figure out is to manually change the formatting every time we copy and paste - which is of course tedious and awkward as sometimes we're working with multiple standards from multiple ordinances and standards. Problem is, I don't understand where I am supposed to add that option. Help also tells me to go to File > Options > Advanced > Cut,Ĭopy, and Paste and add that option if I don't see it. Word Help tells me to select the "matchĭestination formatting" option, which I don't have when I hit CTRL+V - I only see, "Keep Source Formatting", "Merge Formatting", and "Keep Text Only" - no option to match destination formatting. I need it to match the formatting of the paragraph in the bottom (blue) box. The paragraph in the top box was pasted into a staff report from our historic preservation standards. Pages says should happen doesn't happen in reality. Invariably, matching the formatting must be done manually because what the Microsoft help pdf version of one document (case in point, our city's historic preservation standards) and paste into staff reports. At my work, we are often required to refer to a.
