agr2ace

The agr2ace perl script is a utility to aid in the process of converting Agricola dump files to .ace files. It was created for the Arabidopsis ACE database and may require customization depending on your ACEDB models. The program prints the output to the command line, so this must be redirected to a file in order to create the .ace file. Command line usage;

agr2ace.pl Agricola_dump_file > Agricola.ace

In addition to the captured file, the script writes two additonal files which may be helpful in creations of .ace files for your database. The first is a listing of the journal names in "journal.names", and second, the GenBank accessions in the file "genbank.acc".

The references are labelled using the convention of first author + year + a 5 character accession label + #_#number. By default the accession label begins with "aaaaa". If the script is run with the "-start=xxxxx" option, labelling begins with xxxxx, and the next reference would have the accession label of xxxxy.

The "#_#number" is a temporary extention which allows one to easily pick out the new references from those all ready in the database when looking at a keyset of references from the database. This is one method to look for duplicates. Once checked, the "#_#number" should be removed from the name before finally committing the references to the database.

Comments to: John Morris