Changes between Initial Version and Version 1 of dataIntegration


Ignore:
Timestamp:
Dec 6, 2013, 12:43:45 PM (10 years ago)
Author:
dwinter
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • dataIntegration

    v1 v1  
     1= Data Integration =
     2
     3
     4== Integration of Cologne Data ==
     5
     6=== general ===
     7
     8'''convertFaust''' converts faust-xml output into FMPXMLRESULT.
     9
     10''' match_faust_arb''' contains the methods for matching the different object with the ARB data.
     11
     12=== texts ===
     13First step is matching the texts.
     14
     15   * convert text objects from faust to filemaker format.
     16   * import the texts into filemaker database faust-texte.
     17   * export relevant field for matching in this case only the number part of '''signatur''' is comparted with the numbers in '''Title''' in N_source_codex
     18   * match with '''match_faust_arb.doMatchTexts'''
     19   * import results into C_faust_arb_texte
     20   * manually check (layout -> C_faust_arb_texte)
     21
     22
     23Relating the matched texts:
     24
     25   * Layout N_source_codex_faust displayes links to the matched texts.
     26   
     27Importing the new texts/ all non matching texts:
     28   
     29   * Import all faust-ids into N_source_codex, for each is a new empty data set created, original ID is saved in __import_from_faust
     30   * Script import_from_faust_text, imports selected fields from the faust-db into ARB
     31   * Script import_all_from_faust_text, loops through all selected datasets and does the import, if possible.
     32 
     33Matching:
     34
     35   * [ticket:25 Contents / Remarks of the matching docs have to be integrated?]
     36      * [ticket:26 needs an layout to do so]
     37
     38
     39
     40
     41=== recipes ===
     42
     43   * [ticket:21 Matching of recipies has to be done]
     44   * [ticket:18 Import non matching recipes ] (Import all recipies which can't be in ARB because the texts are not in ARB. (Seems to be only one ID:4229)
     45   * [ticket:22 Import remaining recipes]
     46      * first: import von faust-recepte: kommentar -> remark, key --> {{{__import_key__from_cologne}}}, Material --> material
     47      * second : import now : Folio_Seite --> location, c_faust_arb_texte:correct_arb --> ID_target, N_recipies:ID --> ID_source
     48      * Now export the now generated datasets (fields: ID_source, ID)
     49      * Now import the exported data into N_recipes, with ID_soruce matching ID und ID -> ID_recipes_sources
     50       * Now import transcription and translation:
     51       * Export transcription and Übersetzung mit ID_recep_
     52       * Import in N_representation transkription --> transcription, Übersetzung --> translation, C:recipes_soruces:ID --> _tmp
     53       * Export the new datasets from there _tmp,ID
     54       * Import this into C_recipes_source, witch matching _tmp -> ID_source, ID -> ID_representation
     55       * Export from faust-lexikon.fp7 N_ingredient, N_recipes and key
     56       * Generate from the export file with splitExport mapping for C_recipes_ingredient.
     57       * Import this into C_recipes_ingredient
     58       * Import in N_appellation from faust-lexikon, key -> __import_from_cologne, name -> appelation
     59       * Export from N_appellation {{{ __import__from_cologne_ID }}} and ID
     60       * Import into C_recipes_ingredient matching {{{ __import__from_cologne_ID}}} ID --> ID_appellation
     61       * Import in N_lemma from faust-lexikon, key -> __import_from_cologne, lemma -> name
     62       * Export from N_Lemma {{{ __import__from_cologne_ID }}} and ID
     63       * Import into C_recipes_ingredient matching {{{ __import__from_cologne_ID}}} ID --> ID_lemma
     64       * Results need checking [ticket:28 Lemmas] and [ticket:29 Appellation]
     65
     66
     67
     68=== lexikon ===
     69   
     70   * [ticket:24 General: solves problem material vs. ingredient.]
     71   * First step: matching DB_glossary and ingredients. (inported all missing, and comments), [ticket:23 should be checked by Sylvie]
     72   * Import all Moderne Namen ";" separated into to entries in N_ingredient (datei: moderneNamen_id.csv), new field lexikon_id_cologne, this ID was created by a script (modernNameMatch) and is not in the FAUST db.
     73
     74   
     75
     76