Mercurial > hg > anteater
diff training/dataset_desc_applicant.txt @ 0:036535fcd179
anteater
author | jdamerow |
---|---|
date | Fri, 14 Sep 2012 10:30:43 +0200 |
parents | |
children |
line wrap: on
line diff
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/training/dataset_desc_applicant.txt Fri Sep 14 10:30:43 2012 +0200 @@ -0,0 +1,11 @@ +Dataset description for applicant extraction +============================================ + +iss = in same sentence +1=true, 0=false +float found by: 1 exact, 0 not at all +calculate by: 1 - (common_charcaters/all_characters_of_found x common_charcaters/all_characters_of_gnrd/placemaker) +char_appl_to_name: characters to beginning of word applicant (minus or plus) +---------------------- + +text_type(1=Sum,2=SInf) name_length issued_iss applied_iss permit_iss comment_iss isSubject applicant_iss char_applicant_to_name Person/Org/Location(1,2,3) found_by_GNRD(float) start_idx_equals_placemaker_start_idx found_by_Placemaker(float) start_idx_equals_placemaker_start_idx surrounded_by_brackets surrounded_by_commas followed_by_'s isAbbreviation \ No newline at end of file