Changes between Version 7 and Version 8 of normalize_arabic_translit


Ignore:
Timestamp:
May 8, 2015, 3:58:51 PM (9 years ago)
Author:
casties
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • normalize_arabic_translit

    v7 v8  
    1 
    2 
    3 == Normalizing arabic transliterations ==
     1= Normalizing arabic transliterations =
    42
    53Algorithm for normalizing the existing transliterated arabic (_translit fields) in the database.
    64
    7 === New ===
     5== New ==
    86
    9 ==== 1. replace letter combinations ====
     7=== 1. replace letter combinations ===
    108
    119Replace the following letter combinations with a single letter:
     
    2725(replace all y and move to 3.?)
    2826
    29 ==== 2. remove diacritics ====
     27=== 2. remove diacritics ===
    3028
    3129Replace all letters with diacritics with the letter without diacritics.
    3230
    33 ==== 3. replace letters ====
     31=== 3. replace letters ===
    3432
    3533Replace the following letters to unify the searches:
     
    3735|| g, j || j ||
    3836
    39 ==== Questions ====
     37=== Questions ===
    4038
    4139What about apostrophes/accents? Normalize to single-quote (U+2019) or apostrophe (U+0027)?
     
    4745
    4846
    49 === Currently ===
     47== Currently ==
    5048
    5149source:OpenMind/src/main/java/org/mpi/openmind/repository/utils/NormalizerUtils.java
    5250
    53 ==== 1. replace letter combinations ====
     51=== 1. replace letter combinations ===
    5452
    5553Replace the following letter combinations with a single letter.
     
    6361|| ỳ || a ||
    6462
    65 ==== 2. replace letters with diacritics ====
     63=== 2. replace letters with diacritics ===
    6664
    6765Replace all(?) letters with diacritics with the letter without diacritics.