| 1 | = Data Integration = |
| 2 | |
| 3 | |
| 4 | == Integration of Cologne Data == |
| 5 | |
| 6 | === general === |
| 7 | |
| 8 | '''convertFaust''' converts faust-xml output into FMPXMLRESULT. |
| 9 | |
| 10 | ''' match_faust_arb''' contains the methods for matching the different object with the ARB data. |
| 11 | |
| 12 | === texts === |
| 13 | First step is matching the texts. |
| 14 | |
| 15 | * convert text objects from faust to filemaker format. |
| 16 | * import the texts into filemaker database faust-texte. |
| 17 | * export relevant field for matching in this case only the number part of '''signatur''' is comparted with the numbers in '''Title''' in N_source_codex |
| 18 | * match with '''match_faust_arb.doMatchTexts''' |
| 19 | * import results into C_faust_arb_texte |
| 20 | * manually check (layout -> C_faust_arb_texte) |
| 21 | |
| 22 | |
| 23 | Relating the matched texts: |
| 24 | |
| 25 | * Layout N_source_codex_faust displayes links to the matched texts. |
| 26 | |
| 27 | Importing the new texts/ all non matching texts: |
| 28 | |
| 29 | * Import all faust-ids into N_source_codex, for each is a new empty data set created, original ID is saved in __import_from_faust |
| 30 | * Script import_from_faust_text, imports selected fields from the faust-db into ARB |
| 31 | * Script import_all_from_faust_text, loops through all selected datasets and does the import, if possible. |
| 32 | |
| 33 | Matching: |
| 34 | |
| 35 | * [ticket:25 Contents / Remarks of the matching docs have to be integrated?] |
| 36 | * [ticket:26 needs an layout to do so] |
| 37 | |
| 38 | |
| 39 | |
| 40 | |
| 41 | === recipes === |
| 42 | |
| 43 | * [ticket:21 Matching of recipies has to be done] |
| 44 | * [ticket:18 Import non matching recipes ] (Import all recipies which can't be in ARB because the texts are not in ARB. (Seems to be only one ID:4229) |
| 45 | * [ticket:22 Import remaining recipes] |
| 46 | * first: import von faust-recepte: kommentar -> remark, key --> {{{__import_key__from_cologne}}}, Material --> material |
| 47 | * second : import now : Folio_Seite --> location, c_faust_arb_texte:correct_arb --> ID_target, N_recipies:ID --> ID_source |
| 48 | * Now export the now generated datasets (fields: ID_source, ID) |
| 49 | * Now import the exported data into N_recipes, with ID_soruce matching ID und ID -> ID_recipes_sources |
| 50 | * Now import transcription and translation: |
| 51 | * Export transcription and Übersetzung mit ID_recep_ |
| 52 | * Import in N_representation transkription --> transcription, Übersetzung --> translation, C:recipes_soruces:ID --> _tmp |
| 53 | * Export the new datasets from there _tmp,ID |
| 54 | * Import this into C_recipes_source, witch matching _tmp -> ID_source, ID -> ID_representation |
| 55 | * Export from faust-lexikon.fp7 N_ingredient, N_recipes and key |
| 56 | * Generate from the export file with splitExport mapping for C_recipes_ingredient. |
| 57 | * Import this into C_recipes_ingredient |
| 58 | * Import in N_appellation from faust-lexikon, key -> __import_from_cologne, name -> appelation |
| 59 | * Export from N_appellation {{{ __import__from_cologne_ID }}} and ID |
| 60 | * Import into C_recipes_ingredient matching {{{ __import__from_cologne_ID}}} ID --> ID_appellation |
| 61 | * Import in N_lemma from faust-lexikon, key -> __import_from_cologne, lemma -> name |
| 62 | * Export from N_Lemma {{{ __import__from_cologne_ID }}} and ID |
| 63 | * Import into C_recipes_ingredient matching {{{ __import__from_cologne_ID}}} ID --> ID_lemma |
| 64 | * Results need checking [ticket:28 Lemmas] and [ticket:29 Appellation] |
| 65 | |
| 66 | |
| 67 | |
| 68 | === lexikon === |
| 69 | |
| 70 | * [ticket:24 General: solves problem material vs. ingredient.] |
| 71 | * First step: matching DB_glossary and ingredients. (inported all missing, and comments), [ticket:23 should be checked by Sylvie] |
| 72 | * Import all Moderne Namen ";" separated into to entries in N_ingredient (datei: moderneNamen_id.csv), new field lexikon_id_cologne, this ID was created by a script (modernNameMatch) and is not in the FAUST db. |
| 73 | |
| 74 | |
| 75 | |
| 76 | |