Author Topic: Converting scanned text files, help needed  (Read 813 times)

Offline ainslie

  • RootsChat Aristocrat
  • ******
  • Posts: 2,768
  • Census information Crown Copyright, from www.nationalarchives.gov.uk
    • View Profile
Converting scanned text files, help needed
« on: Monday 04 February 13 15:10 GMT (UK) »
Through my failure to back up, or losing backed-up files, in a PC collapse, I lost lots of Word files.  Some had been printed out, and using a new printer/scanner (HP Deskjet 3520) I have started re-creating files to store in my computer.  So far I have managed to scan individual pages and saved them as PDF files.  I had the option of saving them as Photo/JPEG but thought I should be able to reunite the pages into one editable document, using PDF.
So far, no success, but Adobe offered a pop-up, paid-for option of a conversion system.
Is that necessary?
Any help would be welcome.
Thanks in advance.  I'm confident of good advice being offered!
A


Offline alanmack

  • RootsChat Veteran
  • *****
  • Posts: 664
  • Census information Crown Copyright, from www.nationalarchives.gov.uk
    • View Profile
Re: Converting scanned text files, help needed
« Reply #1 on: Monday 04 February 13 16:59 GMT (UK) »
Hi Ainslie,
              Not long ago I rediscovered PDFCreator which has an adjunct program, PDF Architect, which will make composites from single PDF pages. Very long established, good and freeware. :D

Alan
Glamorgan - Carpenter, Chamberlain, Ellis, Watkins, Rees, Bevan
Wiltshire - Carpenter, Chamberlain, Ellis, Merrett
Essex - Burdon, Taylor, Menzies
Canada - Burdon, Parkinson
Australia - Carpenter, Burdon

Offline ITBookworm

  • RootsChat Member
  • ***
  • Posts: 208
  • Census information Crown Copyright, from www.nationalarchives.gov.uk
    • View Profile
Re: Converting scanned text files, help needed
« Reply #2 on: Monday 04 February 13 17:07 GMT (UK) »
Using Alan's program you should be able to get a PDF file with multiple pages without any real problem BUT (there always is a but  :) :) ) I am not sure if you will be able to edit the result.

What you need is to scan with OCR (optical character recognition) which will convert the characters on the paper into characters in the computer file rather than convert the characters on paper to a 'computer picture' of characters on paper.

I know there is free scanner software that will do that and it is possible that your own printer scanner software has that option as well. I have tried it before but not recently so not best placed to advise on that bit.

Good luck.
Dempster, Harvie, Comrie, Adams
O'Neill, Curry, Dunbar, Crichton

Offline alanmack

  • RootsChat Veteran
  • *****
  • Posts: 664
  • Census information Crown Copyright, from www.nationalarchives.gov.uk
    • View Profile
Re: Converting scanned text files, help needed
« Reply #3 on: Monday 04 February 13 17:10 GMT (UK) »
ITB,
      I'd tend to agree with everything you say! :)

Alan
Glamorgan - Carpenter, Chamberlain, Ellis, Watkins, Rees, Bevan
Wiltshire - Carpenter, Chamberlain, Ellis, Merrett
Essex - Burdon, Taylor, Menzies
Canada - Burdon, Parkinson
Australia - Carpenter, Burdon


Offline kevinf2349

  • RootsChat Senior
  • ****
  • Posts: 446
  • Census information Crown Copyright, from www.nationalarchives.gov.uk
    • View Profile
Re: Converting scanned text files, help needed
« Reply #4 on: Monday 04 February 13 18:04 GMT (UK) »
I seem to recall that there is an online facility that will convert a PDF back into a word document.

I remember using it for a PDF I had and it did OK except for diagrams which it had some issues with in terms of page layout. As I recall it doesn't handle more that a certain number of pages (but I think it was in the 100's).

I will see if I can find the URL for you.   http://www.pdftoword.com/

Kevin
Ferguson, Stockton-on-Tees
Hollinshead, Stafford/Guisborough
Pratt, Berwick/Newcastle-upon-Tyne
McDonald, Teesdale
Charlton, Hexham
Carlyle, Hexham/Annan Dumfries

Offline confusion

  • RootsChat Senior
  • ****
  • Posts: 307
  • I was born poor - and still have all of it
    • View Profile
Re: Converting scanned text files, help needed
« Reply #5 on: Monday 04 February 13 18:28 GMT (UK) »
ainslie:

try www.zamzar.com free online file convertor

The vagaries of converting pdf to text documents can sometimes be disappointing.
You may need to re-edit the data. This is due to the difference in the file formats.

As an example, I send text documents as pdf files only, as they cannot be edited/changed
easily.

Hope this helps.

Jim
Willey, Berry, Cox, Davis, Haddock, Hutton, Griffiths/Griffin, Tanner - Worcestershire
Cox, Dudley, Harris, Moore, Neville, Payne - Warwickshire
Chambers, Douds, Dryden, Given, Hamilton, Hassan, McPherson, McWhirter, Simpson, Taggart, Vauls, Whiteside - Ireland/Scotland, Northumberland
Challis, Halls, Heady, Grove, Lawrence - Essex
Foxwell, Imm, Ward - Gloucesteshire
Heady, Collis, Griffin - Hertfordshire
Hurling - Middlesex
Willey, Imm - Monmouthshire
Imm, Hamilton, Hedge, Majury, Sollis - US

Offline ainslie

  • RootsChat Aristocrat
  • ******
  • Posts: 2,768
  • Census information Crown Copyright, from www.nationalarchives.gov.uk
    • View Profile
Re: Converting scanned text files, help needed
« Reply #6 on: Monday 04 February 13 18:35 GMT (UK) »
Thanks for those clues, something to keep me busy tomorrow.
A