Data Integration
Integration of Cologne Data
general
convertFaust converts faust-xml output into FMPXMLRESULT.
match_faust_arb contains the methods for matching the different object with the ARB data.
texts
First step is matching the texts.
- convert text objects from faust to filemaker format.
- import the texts into filemaker database faust-texte.
- export relevant field for matching in this case only the number part of signatur is comparted with the numbers in Title in N_source_codex
- match with match_faust_arb.doMatchTexts
- import results into C_faust_arb_texte
- manually check (layout -> C_faust_arb_texte)
Relating the matched texts:
- Layout N_source_codex_faust displayes links to the matched texts.
Importing the new texts/ all non matching texts:
- Import all faust-ids into N_source_codex, for each is a new empty data set created, original ID is saved in import_from_faust
- Script import_from_faust_text, imports selected fields from the faust-db into ARB
- Script import_all_from_faust_text, loops through all selected datasets and does the import, if possible.
Matching:
recipes
- Matching of recipies has to be done
- Import non matching recipes (Import all recipies which can't be in ARB because the texts are not in ARB. (Seems to be only one ID:4229)
- Import remaining recipes
- first: import von faust-recepte: kommentar -> remark, key -->
__import_key__from_cologne
, Material --> material - second : import now : Folio_Seite --> location, c_faust_arb_texte:correct_arb --> ID_target, N_recipies:ID --> ID_source
- Now export the now generated datasets (fields: ID_source, ID)
- Now import the exported data into N_recipes, with ID_soruce matching ID und ID -> ID_recipes_sources
- Now import transcription and translation:
- Export transcription and Übersetzung mit ID_recep_
- Import in N_representation transkription --> transcription, Übersetzung --> translation, C:recipes_soruces:ID --> _tmp
- Export the new datasets from there _tmp,ID
- Import this into C_recipes_source, witch matching _tmp -> ID_source, ID -> ID_representation
- Export from faust-lexikon.fp7 N_ingredient, N_recipes and key
- Generate from the export file with splitExport mapping for C_recipes_ingredient.
- Import this into C_recipes_ingredient
- Import in N_appellation from faust-lexikon, key -> import_from_cologne, name -> appelation
- Export from N_appellation
__import__from_cologne_ID
and ID - Import into C_recipes_ingredient matching
__import__from_cologne_ID
ID --> ID_appellation - Import in N_lemma from faust-lexikon, key -> import_from_cologne, lemma -> name
- Export from N_Lemma
__import__from_cologne_ID
and ID - Import into C_recipes_ingredient matching
__import__from_cologne_ID
ID --> ID_lemma - Results need checking Lemmas and Appellation
- first: import von faust-recepte: kommentar -> remark, key -->
lexikon
- General: solves problem material vs. ingredient.
- First step: matching DB_glossary and ingredients. (inported all missing, and comments), should be checked by Sylvie
- Import all Moderne Namen ";" separated into to entries in N_ingredient (datei: moderneNamen_id.csv), new field lexikon_id_cologne, this ID was created by a script (modernNameMatch) and is not in the FAUST db.
Last modified 10 years ago
Last modified on Dec 9, 2013, 11:50:22 AM