view training/dataset_desc_applicant.txt @ 0:036535fcd179

anteater
author jdamerow
date Fri, 14 Sep 2012 10:30:43 +0200
parents
children
line wrap: on
line source

Dataset description for applicant extraction
============================================

iss = in same sentence
1=true, 0=false
float found by: 1 exact, 0 not at all
calculate by: 1 - (common_charcaters/all_characters_of_found x common_charcaters/all_characters_of_gnrd/placemaker)
char_appl_to_name: characters to beginning of word applicant (minus or plus)
----------------------

text_type(1=Sum,2=SInf) name_length issued_iss applied_iss permit_iss comment_iss isSubject applicant_iss char_applicant_to_name Person/Org/Location(1,2,3) found_by_GNRD(float) start_idx_equals_placemaker_start_idx found_by_Placemaker(float) start_idx_equals_placemaker_start_idx surrounded_by_brackets surrounded_by_commas followed_by_'s isAbbreviation