Author Topic: What's the best software or site to merge datasets with lots of duplicates?  (Read 684 times)

Offline Barly2

  • RootsChat Extra
  • **
  • Posts: 2
  • Census information Crown Copyright, from www.nationalarchives.gov.uk
    • View Profile
Classical (stupid) problem: I've two versions of a gedcom, both now have unique and important data, but most of it is the same, not just a handful of people.

I tried GenMerge, but there are some issues (which they tell me will be resolved in the upcoming release, but that doesn't help me now).

Any tips for automated merging of duplicates, with the option to nod off or dismiss?

Offline Treetotal

  • RootsChat Marquessate
  • *******
  • Posts: 28,500
    • View Profile
Re: What's the best software or site to merge datasets with lots of duplicates?
« Reply #1 on: Monday 16 December 13 14:56 GMT (UK) »
Hi and welcome to Rootschat...some good advice here:

http://familytreemagazine.com/article/Match-Makers

Carol
CAPES Hull. KIRK  Leeds, Hull. JONES  Wales,  Lancashire. CARROLL Ireland, Lancashire, U.S.A. BROUGHTON Leicester, Goole, Hull BORRILL  Lincolnshire, Durham, Hull. GROOM  Wishbech, Hull. ANTHONY St. John's Nfld. BUCKNALL Lincolnshire, Hull. BUTT Harbour Grace, Newfoundland. PARSONS  Western Bay, Newfoundland. MONAGHAN  Ireland, U.S.A. PERRY Cheshire, Liverpool.
 
RESTORERS:PLEASE DO NOT USE MY RESTORES WITHOUT PRIOR PERMISSION - THANK YOU

Offline Barly2

  • RootsChat Extra
  • **
  • Posts: 2
  • Census information Crown Copyright, from www.nationalarchives.gov.uk
    • View Profile
Re: What's the best software or site to merge datasets with lots of duplicates?
« Reply #2 on: Monday 16 December 13 16:03 GMT (UK) »
Thanks for the welcome!

I've tried GenMerge, one of the tools mentioned in the link, but there were some problems - at the moment, it seems to be auto-only, so you can't tell the thing that Sebastian Duke of Deedumbia of the 13th ct isn't the same as Kevin Sebastian, born 1984, once it gets it wrong.

Offline GrahamSimons

  • RootsChat Marquessate
  • *******
  • Posts: 3,146
    • View Profile
Re: What's the best software or site to merge datasets with lots of duplicates?
« Reply #3 on: Monday 16 December 13 19:58 GMT (UK) »
I use Family Historian, and have done a successful fairly-big merge. The help documentation does advise making copies of both files before doing anything else, but I didn't need to revert to the backup.
Simons Barrett Jaffray Waugh Langdale Heugh Meade Garnsey Evans Vazie Mountcure Glascodine Parish Peard Smart Dobbie Sinclair....
in Stirlingshire, Roxburghshire; Bucks; Devon; Somerset; Northumberland; Carmarthenshire; Glamorgan


Offline falcybe

  • RootsChat Veteran
  • *****
  • Posts: 840
  • Census information Crown Copyright, from www.nationalarchives.gov.uk
    • View Profile
Re: What's the best software or site to merge datasets with lots of duplicates?
« Reply #4 on: Saturday 21 December 13 21:50 GMT (UK) »
You can do it in Legacy, too, http://www.legacyfamilytree.com/; my sister has merged lots of trees with this and I have carried out a couple of merges.

Beware though that no programme is fool-proof, you must take care to check each proposed merge yourself

cheers, falcybe
Hayden Cowan Weir Jowett Barclay Howard Gooch Joiner Rayner Ash Travers Coltman Samuel Falconer Lacey Croton Clarke Robinson Alden Burroughs
Ford Lusty Jones Wice Wise Scorey Rayner Harding Bacon Chambers and lots more
Click on the little house on the left to go to our site