How to generate a Gramps xml file outside of Gramps?

SNoiraud · December 17, 2021, 1:11pm

I made a PR for that: CSV: possibility to select the dialect. by SNoiraud · Pull Request #1314 · gramps-project/gramps · GitHub

ennoborg · December 17, 2021, 2:13pm

I got an earlier version of your answer in my mail, and I agree. In research, you want to be able to create relations between all sorts of objects without restrictions, and preferably right from the database, and not from an export.

This means that, in an ideal world, it would be more interesting to have a more open format inside the database, and not just in Gramps XML.

emyoulation · December 17, 2021, 2:18pm

That takes care of outbound and will be very helpful for porting data outside. Thank you!

But neither the Import CSV nor the Import Text gramplet allow bringing in data with other than Comma delimiters. (A simple copy/paste of cells from Excel comes into the Import Text Gramplet as tab delimited.)

SNoiraud · December 17, 2021, 5:32pm

I added this feature to importcsv. Tested and I see all places are created and you don’t need to recreate the hierarchy.

emyoulation · December 17, 2021, 6:29pm

That’s great!

Sorry to have to ask instead of trying it myself. My Hotspot is offline until the cellular data plan resets in a couple days. I can read but not download!

GeorgeWilmes · December 18, 2021, 1:02am

I just tried something different. When running the export, I used a custom person filter “Nobody” (which returns values that do not match the filter rule “Everyone”). The output seems to include all objects other than people or families (i.e, places, repositories, sources, and citations, but also events and media). Not sure what I would ever use it for, but found it interesting.

I see that one can change the order of the filtering in the Export Options dialog (by using the “Change order” button) but I haven’t tried that yet. Maybe that would help?

emyoulation · December 18, 2021, 1:56am

Unfortunately not. It is noted at the end of the bug report (in the Additional Notes section) that the test was repeated with the People rule as first (the default) and last with no difference.

DaveSch · December 18, 2021, 2:47am

If you want a new database, you can populate the Places and Sources/Repository without having to start from scratch again.

ennoborg · December 19, 2021, 1:59pm

Sometimes sites have information that gets lost when it’s converted to standard GEDCOM, and this may be the case with FamilySearch which used the richer GedcomX to communicatie with client programs like getmyancestors, and also programs like Ancestral Quest and RootsMagic, which I use to download parts from the shared family tree.

I do agree that modifying getmyancestors to write Gramps XML is probably overkill, because there is not much that get lost in standard GEDCOM. The only thing that I can think of right now is the name type, and for me that wouldn’t be worth it, because the shared tree is too messy for that. But even when you think that part is important, it may be easier to add a custom _TYPE tag to the getmyancestors output, and make sure that Gramps can read that. The amount of code involved in such a construct is way smaller than adding full support for Gramps XML.

Please note that GEDCOM itself is not as inferior as some may think, because basically it’s just a format, like JSON and XML, meaning that it’s a way to store objects with nested contents and relations beteeen them. This means that with a couple of custom tags, we could make a GEDCOM export that includes all the information that we have in the Gramps database, or Gramps XML. In fact, this is exactly what RootsMagic does, in a very nice way. According to Randy Seaver, it’s the only program that can read its own GEDCOM without losing anything, and I know that it has full support for citation templates, in GEDCOM format.

This does not mean that I’m against creating Gramps XML. I’d love to see a web scraping program that creates Gramps XML extracts from my favorite sites, just like @PLegoux suggested. But that also means that we probably need a smarter import too, to avoid duplicate locations and all that.

emyoulation · December 19, 2021, 2:43pm

Yeah, a BeautifulSoup or Selenium add-on for Gramps would be nice for Gramps. ( html - Use Python to Scrape for Data in Family Search Records - Stack Overflow or Web Scraping Tutorial Using Selenium & Python (+ examples) | ScrapingBee )

But data scraping makes wonder how (or whether) to integrate the data with your painstakingly curated Tree? In many cases, there is an attraction to doing prospecting trips in your research to collect leads. But while collecting, you’re well aware that most of the leads won’t pan out. You just want keep them neat and accessible, not fold them into your Tree prematurely.

ennoborg · December 19, 2021, 6:54pm

Note: The name type does exist in GEDCOM 5.5.1, so currently I don’t see much reason to create a Gramps XML writer for getmyancestors.

ennoborg · December 19, 2021, 7:23pm

When I think of webscraping, I don’t necessarily think of Python. That’s partly because I haven’t done any Python in years, and only programmed in C#, but maybe more importantly because it would be nice to have that in a Chrome/Edge or Firefox-add-on, which probably means that it’s better to use JavaScript.

For the integration part, I would suggest a specialized piece of XML, that stores the data of all participants present at an event, and their roles, just like we already store these data when we use the forms gramplet, but in a more generic way, not using attributes, but using a real schema. And you can find a possible example for this, on this source page, of a Dutch site that happens to have a birth record for a Marie Antoinette Evelina Legoux, who was born in Brugge, and who died in Paris:

You can view this page in 4 languages, including French and English, but the interesting part is in the page source, close to the bottom. When you look at that, you can see an XML document, that has all necessary source data in it, meaning the persons, their roles, all about the event, and the source meta data that the site uses to generate a formatted citation. It’s all there, and it’s documented here:

A2A is a local standard, used by all archives in The Netherlands, but this particular record shows that it was also adopted by at least one archive in Belgium, and the daughter’s death record shows that it was also adopted by an archive in France.

Note: The site also has a McCullough family emigrating from Rotterdam.

ennoborg · December 20, 2021, 1:04pm

OK, good point, and something that can be addressed, when someone finds time for that. And that leads to questions like:

Would it be worth our (developers’) effort to create an exporter for places, which includes all attached notes, sources, etc? I don’t need that myself, because I work with a single large tree, but it can be a big time saver for users who work with separate ones.
Or could we just as well rely on an external tool, that reads a Gramps backup file, and does this outside Gramps? For me, that would work just as well, and it would allow me to write such a tool in a language that I’m way more proficient in, like C#, and another person might prefer Java for that. And an advanced Python developer could even write an independent tool that reads the pickled data from our database.

My personal view is that there is no real need to create another exporter, because a Gramps XML backup has the full database in it, and any smart person can figure out what to do with it, even without documentation, simply by reading the XML, and trying to make sens of that. That’s reverse engineering, and it can work quite well. It’s also what we do, when we’re confronted with an exotic GEDCOM file, created by a company that has better things to do, than to write documentation for competitors.

And in the case of places, one can also think of a selective import instead.

romjerome · March 15, 2025, 7:55pm

moved from:

Rough export to Gramps XML file format

Hello.
Playing with xml content, I wonder if a “lite” skeleton (structure) of a gramps file could be useful?

Let me try to explain with an experimental mix-up of pieces of code. It has been extracted from two addons (working into gramps). So, I only tried to clean them, and tried to keep the logic. Does not work as stand-alone script.

from xml.etree import ElementTree # or from lxml import etree
from shutil import copy
entry = os.path.join($HOME, 'example.gramps')
filename = os.path.join($HOME, 'example.xml')
copy(entry, filename)
tree = ElementTree.parse(filename)
root = tree.getroot()
root.clear()
surnames = []
places = []
sources = []
the_id = 0

people = etree.SubElement(root, "people")
for s in surnames:
    the_id += 1
    person = etree.SubElement(people, "person")
    person.set('id', str(the_id) + '_' + str(len(surnames)))
    name = etree.SubElement(person, "name")
    surname = etree.SubElement(name, "surname")
    surname.text = s
pl = etree.SubElement(root, "places")
for p in places:
    the_id += 1
    place = etree.SubElement(pl, "placeobj")
    place.set('id', str(the_id) + '_' + str(len(places)))
    name = etree.SubElement(place, "pname")
    pname = name.set('value', p)
src = etree.SubElement(root, "sources")
for s in sources:
    the_id += 1
    source = etree.SubElement(src, "source")
    source.set('id', str(the_id) + '_' + str(len(sources)))
    stitle = etree.SubElement(source, "stitle")
    stitle.text = s
print(etree.tostring(root, method='xml', pretty_print=True)
root.clear()

Less than 40 lines, that’s a rough exporter for Gramps file format.

Sure, need to polish it for running as a stand-alone script for production. As you can see, that’s a poor and lazy way for quickly generating a gramps file.

Anyway, in relation with a previous question (2021) : How to generate a Gramps xml file outside of Gramps?.

On an other post, I noted a custom behavior and extra code for legacy (old XML style). During this investigation, I had to quickly export many id values for a complete importation into Gramps (mapping stuff & co). I do not have problem with that, as it was for testing (more than 76 000 ‘alone’ records ; my bad, it should be 615 surnames… there is a loop on the above lines: should look at person with the same surname before iteration).

e.g.,

unique_surnames = list(set(surnames))
unique_surnames.sort()

Finally, as Gramps has a consistent namespace, all these xml tags are always related to gramps and we could make addition of gramps xml files content, by mapping ids (not handle, more and less like gedcom ids), before an import into a Family Tree, whatever source generating this Gramps XML file.

To write a complete Gramps XML exporter does not always make sense (json, text, csv) for some small projects (open-sources or not).

Maybe having a lite skeleton also mean to have a generator identification? For knowing where this Gramps XML come from!

More advanced, an AI transformer-generator to a Gramps XML file format?

Just some thoughts.
Jérôme

system · April 14, 2025, 7:56pm

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
GetMyAncestors : a standalone tool to export FamilySearch ancestors Help familysearch	38	3284	March 8, 2025
Gramps and schema.org Genealogy	3	554	March 20, 2023
Exporting data from FamilySearch Help familysearch	2	2518	July 20, 2021
Rewrite Gramps XML importer? Beta Testing hacks	12	155	April 15, 2025
If there a chance of a version downgrade XML exporter? Development	6	95	May 7, 2025

How to generate a Gramps xml file outside of Gramps?

Rough export to Gramps XML file format

Related topics