text-processing - Proper way to convert PDF to word from bash command-line

Problem

I need to convert 1K pdf files to doc on a debian server. I can convert a PDF to word using libreoffice commandline: libreoffice --headless --invisible --convert-to doc Sample-doc-file-100kb.pdf

Solution

The main problem with the above two commands, is that the doc file doesn't include images in the pages, it only contains the formatted text. Is there a better way to convert pdf to doc, including also the images present in the pdf? I am not interested in web services like zamzam, I need to do that from command-line on the server.

One possible solution

I managed to do it by using this: libreoffice --infilter=="writer_pdf_import" --headless \ --convert-to doc:"writer_pdf_Export" Brief.pdf This solution gives me the same output as @igiannak's answer.

Another possible solution

I tried converting the PDF to HTML and then to doc, but I encountered a problem with the resultant doc file being detected as a pdf and libreoffice opening it in Draw.

New Solution

any direct command line interface command is available with pdf to docx conversion including images present in the pdf and I tried libreoofice and soffice commands it was giving only simple formatted text like any other pywin32 com clinet library is available on linux/ubuntu during pdf to word conversion

import os
import sys
import comtypes.client
wdFormatPDF = 17
def covx_to_pdf(infile, outfile):
    """Convert a Word .docx to PDF"""
    word = comtypes.client.CreateObject('Word.Application')
    doc = word.Documents.Open(infile)
    doc.SaveAs(outfile, FileFormat=wdFormatPDF)
    doc.Close()
    word.Quit()

But this package can not support to linux/debian platforms.

Can we have any suggestion for this same implementation on Linux/debian for pdf to word conversion?

. . .

AI Text Generator | AI-Powered

Empower your content creation with our Text Generator. Whether it's for websites, projects, or creative endeavors, effortlessly generate text that suits ...

Chorme//flags# enable to erlounc - Google Chrome Community

Oct 5, 2019 ... Chorme//flags# enable to erlounc ... This question is locked and replying has been disabled. ... Community content may not be verified or up-to-date ...

Free AMA citation generator [2024 Update] - BibGuru

Create AMA citations and reference lists in seconds with our easy-to-use citation generator. Accurately reference books, journals, websites and much more in ...

9 Instagram analytics tools for better results in 2024

Tracking your Instagram analytics is the only way to build an effective Instagram strategy. If you're not tracking data, you're just guessing about what works.

LastPass - Sign In

LastPass is an online password manager and form filler that makes web browsing easier and more secure.