Changes between Version 1 and Version 2 of normalize_arabic_translit


Ignore:
Timestamp:
May 7, 2015, 4:05:55 PM (9 years ago)
Author:
casties
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • normalize_arabic_translit

    v1 v2  
    66Currently: source:OpenMind/src/main/java/org/mpi/openmind/repository/utils/NormalizerUtils.java
    77
     8=== 1. replace letter combinations ===
     9
     10Replace the following letter combinations with a single letter.
     11
     12|| th || t ||
     13|| kh || h ||
     14|| dh || d ||
     15|| sh || s ||
     16|| gh || g ||
     17|| "aẗ ", "at " || "a " ||
     18|| ỳ || a ||
     19
     20=== 2. replace letters with diacritics ===
     21
     22Replace all letters with diacritics with the letter without diacritics.
     23
     24Remove all apostrophes.