cm0002@lemmy.world to Programmer Humor@programming.dev · 2 days agoDOGE employeelemmy.worldexternal-linkmessage-square99fedilinkarrow-up1557arrow-down111
arrow-up1546arrow-down1external-linkDOGE employeelemmy.worldcm0002@lemmy.world to Programmer Humor@programming.dev · 2 days agomessage-square99fedilink
minus-squarelime!@feddit.nulinkfedilinkEnglisharrow-up15arrow-down1·edit-21 day ago$ pandoc doc.pdf -o doc.txt Edit: welp, pandoc can’t do that. pdftotext it is.
minus-squaremexicancartel@lemmy.dbzer0.comlinkfedilinkEnglisharrow-up2·edit-21 day agomagick file.jpg file.html Imagemagick be converting anything into anything (Actually in this case, it make an html file and a png file which is referenced in html file and html page displays it)
minus-squarelime!@feddit.nulinkfedilinkEnglisharrow-up2·1 day agonot really a good way to get the text out of a pdf though. then again, turns out neither is pandoc.
minus-squarestetech@lemmy.worldlinkfedilinkarrow-up1·2 days agoI thought pandoc didn’t support from PDF, only to?!
minus-squarelime!@feddit.nulinkfedilinkEnglisharrow-up2·1 day agodamn it, you’re right. should probably have checked that…
minus-squarestetech@lemmy.worldlinkfedilinkarrow-up1·1 day agoDon’t worry, I didn’t know either and had to check to check too :P
$ pandoc doc.pdf -o doc.txt
Edit: welp, pandoc can’t do that.
pdftotext
it is.Imagemagick be converting anything into anything (Actually in this case, it make an html file and a png file which is referenced in html file and html page displays it)
not really a good way to get the text out of a pdf though. then again, turns out neither is pandoc.
I thought pandoc didn’t support from PDF, only to?!
damn it, you’re right. should probably have checked that…
Don’t worry, I didn’t know either and had to check to check too :P