--dc--adobecom.hlx.page/dc-shared/assets/images/shared-images/frictionless/seo-icons/word-pdf-converting.svg An Acrobat PDF document and text document with arrows showing how you can convert a PDF to Microsoft Word
--dc--adobecom.hlx.page/dc-shared/assets/images/shared-images/frictionless/seo-icons/download-and-share.svg A download arrow with a small cloud showing that you can easily download your converted file
As the inventor of the PDF file format, Adobe makes sure our Acrobat PDF to Word converter preserves your document formatting. When you use our online conversion tool, your fonts, images and alignments will look as expected. The converted file is an editable Word document that you can start using straightaway in Microsoft Word online.
You can also try Adobe Acrobat Pro free for seven days to convert files to and from Microsoft 365, edit PDF documents with PDF editor tools, edit scanned documents using optical character recognition (OCR) functionality, merge PDFs, organise or rotate PDF pages, split PDFs, reduce file size and convert HTML, TXT, RTF, PNG, JPG, BMP and other formats to PDF.
I'm asking about this in StackExchange Academia rather than a software-recommendation site because I have conversion problems specific to the layout of academic publications. I'm helping a student who has problems with their eyesight. They find it hard to read academic papers and books on screen, when distributed as PDF. They prefer to have these converted to Word, then read that on screen. This helps them because they can change the font of any part that's hard to read, as well as highlighting section headings and other cues to help them navigate. There are many PDF-to-Word converters, but all the ones I've tried (see below) have defects. Can anyone recommend something better than those I've listed? The converter must not mangle tables, diagrams, footnotes, formulae, subscripts and superscripts, or other complicated content typical of academic publications.
The student does not like to read printed text, and only has an A4 printer, so blowing up the text and printing that isn't feasible. The pandemic introduces its own constraints, since they don't want to venture from their flat to use University computers. So they're restricted to what I think is a 22-inch screen, Windows 10, and their A4 printer. I only mention these because commenters suggested them. The student is comfortable reading Word files, and I want to help them do that, rather than force them to do something outside their preferences.
Kofax Power PDF, but it's very erratic, though. For example, when I converted an archaeological report which I had to ask Power PDF to OCR, it "thought" it had found characters in the thin lines indicating boundaries in a diagram of the archaeological site. Power PDF also mangles structural chemical formulae, even when told not to OCR. Footnotes and tables cause even more problems, as do in-line chemical formulae, and subscripts and superscripts.
SmallPDF. This claims to be "the platform that makes it super easy to convert and edit all your PDF files". It is not. It converts some so they work better than those done by Power PDF, but crashes on others.
PDF24. This probably does a better job than either Power PDF or SmallPDF, when it works. But on some PDFs that need OCR'ing, it just emits the original page-images. This may well be a bug, and I've reported it, but I've not had any reply.
Have you tried the obvious of either exporting the pdf from Adobe Acrobat importing the pdf into Word. I tried just now with two papers and the results are really good with regards to figures, tables, footnotes, references, layout, ...
Make sure to download the documents instead of viewing them in the Google drive. They will look closer to the original in Word itself than rendered by Google drive. The only major issue I can see is the title page in both cases--but the PDF export seems to do a slightly better job.
I'll start by disclaiming: My initial solutions do not follow the prescribed pdf to doc conversion (whilst my last solution does, albeit, it's probably a bad solution), and I haven't tested them, which violates the OP's constraints. I disregard these constraints because I don't think they are particularly relevant.
The trivial, obvious solutions, which have surely been tried, are: Large screen plus zoom and printing to a larger paper size, e.g., A3 or A2 (the former being more commonly available), rather than A4. Annotations and highlighting can be applied to pdfs with numerous tools.
For various other enhancements, numerous tools exist on Linux, which can be scripted to suit needs. For instance, a page could be split into smaller parts and manipulated in various ways, e.g., split a portrait page into top- and bottom-parts, and blow up each part onto a horizontal page (optionally of larger size). All operations can be automated. This approach cannot fail particularly badly for academic papers. At worst, perhaps the split is applied in the wrong place, but, the two parts can be displayed simultaneously, aligned at the lower/upper edge, or by folding the top edge of a printed page.
Linux tools can also be used to convert to word processor format, for instance, pdf2ps and ps2txt can be combined to convert pdf to plaintext, which any word processor can open. (This is my worst solution. I doubt tables, footnotes, formulae, etc. will be handled particularly well.)
Converting to a word processor format seems like a misdirection. That's why my last solution probably won't work. Extracting another format from a pdf seemingly goes against the design goal of the pdf format, namely, to present documents in a [software independent] manner...encapsulat[ing, in each file,] a complete description of a fixed-layout flat document, including the text, fonts, vector graphics, raster images and other information needed to display it (source: Wikipedia).
When I teach my students about this, I tell them that when someone asks if they can get data out of a PDF document, the first answer should be "No." The second answer, if pressed, should be "No. Is there any other way to get it?" And if still pressed, the final answer is "Fine, but it won't be very reliable."
This is especially true with OCR. The software is using algorithms, often Google's Tesseract engine, to try and give its best guess what it's seeing. There will nearly always be errors, especially if the scan may not be the highest quality. You're going to see spacing messed up a lot, but also close values, like confusing a 1 and a 7, or a 0 and an O. Tables are especially prone to being butchered. If you upload any PDF to Google Docs, it gives you the option to convert it to text in the menu. You can also download Tesseract hooks for Python, for example.
Honestly, even non-OCR PDF documents can be unreliable. Have you ever copy-pasted text out of a PDF, and noticed that some of the spaces disappear? Well, software converting it to text encounters problems as well.
The PDF specification isn't a plain text document the way Word files are. For example, a trick not many people know is that you can unzip a .docx file into its component parts using regular unzip software, and one of those parts is a text file. This is not at all true with a PDF document.
Well, there are many ways to do this. One way to do this is online converter such as Online2PDF and Smallpdf. You can also do this by using Microsoft word.
I hope it will help you to do this.
Thanks
Hi @fl35343 ,
I have not heard back from you in a few days and wanted to check on the status of the problem. If you have any questions with the previous information I've provided, please don't hesitate to let me know. I am glad to be of assistance.
Hi @fl35343 ,
If you want to convert PDF file to Word document, you can refer to this support article:
-to/pdf-to-word-doc-converter.html
Note: Microsoft is providing this information as a convenience to you. The sites are not controlled by Microsoft. Microsoft cannot make any representations regarding the quality, safety, or suitability of any software or information found there. Please make sure that you completely understand the risk before retrieving any suggestions from the above link.
If the response is helpful, please click "Accept Answer" and upvote it.
Note: Please follow the steps in our documentation to enable e-mail notifications if you want to receive the related email notification for this thread.
It's very easy, you can download PDF converter software like Adobe or PDF-XChange to convert PDF to Word. If you don't want to download apps, you can also use online pdf tools to convert it, like smallpdf, ilovepdf, compdf. Just directly upload your PDF files, then convert it to Word, and download the Word files.
Another proven way to convert PDF to Word on Windows is using the popular Adobe Acrobat. It is one of the best PDF to Word converters that works on Windows PC and Mac. To convert a PDF to a Word document using Adobe Acrobat on a Windows computer, follow these steps:
Step 1: Start by opening Adobe Acrobat. If you don't have it installed, you can download and install it from the Adobe website. Adobe Acrobat Reader is free, but to convert PDFs to Word documents, you will need Adobe Acrobat Pro, which is a paid version.
Ah, gotcha! So, I stumbled upon this neat little tool called "UniPDF." As a best PDF to word converter for Windows computer, UniPDF is pretty straightforward to use. You just drag and drop your PDF files into the interface, select Word as the output format, and hit convert button to conver PDF to word on Windows computer. What I love about it is that it manages to maintain most of the formatting pretty darn well, even with complex layouts and funky fonts.
Once the conversion is complete, navigate to the output folder you selected earlier. You'll find all the converted Word documents there. Open them in Microsoft Word to review and ensure that the formatting has been preserved to your satisfaction.
d3342ee215