wiki:dataIntegration

Data Integration

Integration of Cologne Data

general

convertFaust converts faust-xml output into FMPXMLRESULT.

match_faust_arb contains the methods for matching the different object with the ARB data.

texts

First step is matching the texts.

  • convert text objects from faust to filemaker format.
  • import the texts into filemaker database faust-texte.
  • export relevant field for matching in this case only the number part of signatur is comparted with the numbers in Title in N_source_codex
  • match with match_faust_arb.doMatchTexts
  • import results into C_faust_arb_texte
  • manually check (layout -> C_faust_arb_texte)

Relating the matched texts:

  • Layout N_source_codex_faust displayes links to the matched texts.

Importing the new texts/ all non matching texts:

  • Import all faust-ids into N_source_codex, for each is a new empty data set created, original ID is saved in import_from_faust
  • Script import_from_faust_text, imports selected fields from the faust-db into ARB
  • Script import_all_from_faust_text, loops through all selected datasets and does the import, if possible.

Matching:

recipes

  • Matching of recipies has to be done
  • Import non matching recipes (Import all recipies which can't be in ARB because the texts are not in ARB. (Seems to be only one ID:4229)
  • Import remaining recipes
    • first: import von faust-recepte: kommentar -> remark, key --> __import_key__from_cologne, Material --> material
    • second : import now : Folio_Seite --> location, c_faust_arb_texte:correct_arb --> ID_target, N_recipies:ID --> ID_source
    • Now export the now generated datasets (fields: ID_source, ID)
    • Now import the exported data into N_recipes, with ID_soruce matching ID und ID -> ID_recipes_sources
      • Now import transcription and translation:
      • Export transcription and Übersetzung mit ID_recep_
      • Import in N_representation transkription --> transcription, Übersetzung --> translation, C:recipes_soruces:ID --> _tmp
      • Export the new datasets from there _tmp,ID
      • Import this into C_recipes_source, witch matching _tmp -> ID_source, ID -> ID_representation
      • Export from faust-lexikon.fp7 N_ingredient, N_recipes and key
      • Generate from the export file with splitExport mapping for C_recipes_ingredient.
      • Import this into C_recipes_ingredient
      • Import in N_appellation from faust-lexikon, key -> import_from_cologne, name -> appelation
      • Export from N_appellation __import__from_cologne_ID and ID
      • Import into C_recipes_ingredient matching __import__from_cologne_ID ID --> ID_appellation
      • Import in N_lemma from faust-lexikon, key -> import_from_cologne, lemma -> name
      • Export from N_Lemma __import__from_cologne_ID and ID
      • Import into C_recipes_ingredient matching __import__from_cologne_ID ID --> ID_lemma
      • Results need checking Lemmas and Appellation

lexikon

Remarks

Last modified 10 years ago Last modified on Dec 9, 2013, 11:50:22 AM