[Linux] converting PDF to DOC?

Adam Glass linux@flux.org
Fri, 22 Jun 2007 13:04:59 -0400


------=_Part_4569_3498836.1182531899414
Content-Type: text/plain; charset=ISO-8859-1; format=flowed
Content-Transfer-Encoding: 7bit
Content-Disposition: inline

I don't know.  I've never used it either, just recall getting a scanner
years ago and it included OCR software.

--Adam


On 6/22/07, Robert Citek <robert.citek@gmail.com> wrote:
>
> On 06/22/2007 10:32 AM, Adam Glass wrote:
> > You could print the PDF and then scan the pages and then run the OCR
> > software.
>
> Would it be possible to directly convert the PDFs to whatever image
> format the OCR is expecting?  For example, using convert:
>
> $ convert f1040.pdf f1040.bmp
>
> and then load those bitmap files into the OCR.
>
> Just a thought.  I've never done OCR.
>
> Regards,
> - Robert
>
> _______________________________________________
> Linux mailing list
> Linux@flux.org
> http://www.flux.org/mailman/listinfo/linux
>

------=_Part_4569_3498836.1182531899414
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: 7bit
Content-Disposition: inline

I don&#39;t know. &nbsp;I&#39;ve never used it either, just recall getting a scanner years ago and it included OCR software.<div><br>&nbsp;</div><div>--Adam</div><div><br><br><div><span class="gmail_quote">
On 6/22/07, <b class="gmail_sendername">Robert Citek</b> &lt;<a href="mailto:robert.citek@gmail.com" target="_blank" onclick="return top.js.OpenExtLink(window,event,this)">robert.citek@gmail.com</a>&gt; wrote:</span><blockquote class="gmail_quote" style="margin:0;margin-left:0.8ex;border-left:1px #ccc solid;padding-left:1ex">

On 06/22/2007 10:32 AM, Adam Glass wrote:<br>&gt; You could print the PDF and then scan the pages and then run the OCR<br>&gt; software.<br><br>Would it be possible to directly convert the PDFs to whatever image<br>format the OCR is expecting?&nbsp;&nbsp;For example, using convert:
<br><br>$ convert f1040.pdf f1040.bmp<br><br>and then load those bitmap files into the OCR.<br><br>Just a thought.&nbsp;&nbsp;I&#39;ve never done OCR.<br><br>Regards,<br>- Robert<br><br>_______________________________________________
<br>Linux mailing list<br><a href="mailto:Linux@flux.org" target="_blank" onclick="return top.js.OpenExtLink(window,event,this)">Linux@flux.org</a><br><a href="http://www.flux.org/mailman" target="_blank" onclick="return top.js.OpenExtLink(window,event,this)">
http://www.flux.org/mailman</a>/listinfo/linux<br></blockquote></div><br></div>

------=_Part_4569_3498836.1182531899414--