view training/dataset_desc_applicant.txt @ 8:05b2ad41e8f0

linnaeus
author jdamerow
date Fri, 09 Nov 2012 16:12:01 -0700
parents 036535fcd179
children
line wrap: on
line source

Dataset description for applicant extraction
============================================

iss = in same sentence
1=true, 0=false
float found by: 1 exact, 0 not at all
calculate by: 1 - (common_charcaters/all_characters_of_found x common_charcaters/all_characters_of_gnrd/placemaker)
char_appl_to_name: characters to beginning of word applicant (minus or plus)
----------------------

text_type(1=Sum,2=SInf) name_length issued_iss applied_iss permit_iss comment_iss isSubject applicant_iss char_applicant_to_name Person/Org/Location(1,2,3) found_by_GNRD(float) start_idx_equals_placemaker_start_idx found_by_Placemaker(float) start_idx_equals_placemaker_start_idx surrounded_by_brackets surrounded_by_commas followed_by_'s isAbbreviation