Changes between Version 6 and Version 7 of Regex from Alvarus


Ignore:
Timestamp:
May 12, 2009, 1:53:22 PM (15 years ago)
Author:
trzeciok
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • Regex from Alvarus

    v6 v7  
    22In each list has the following form: at first a string of regular expressions and characters for a certain set of abbreviations in the raw text and at second a string of regular expressions for the same set containing the abbreviation tagged as attribute and the dissolved abbreviation for the text which is to be postprocessed.
    33
    4 [ List of Regular Expressions to substitute simple/standart abbreviations ]
     4List of Regular Expressions to substitute simple/standart abbreviations
    55
    6 === List of Regular Expression to substitute special abbreviations ===
     6List of Regular Expression to substitute special abbreviations
    77
    8 === List of Regular Expressions to substitute abbreviations with line break ===
     8List of Regular Expressions to substitute abbreviations around a line break
    99
    1010NB: Characters indicating an abbriviation in the raw text (like an o with a tilde stands for om or on) have been substituted with numbers in brackets. If these special characters have an expression according to the Archimedes dtd, they are transformed in the second set. Never the less there are characters which are not covered by the Archimedes dtd and will appeare as numbers in the second set without the brackets. Please note, that sometimes a comma is before these numbers inside an xml-attribute. Unfortunately, it had to be introduced to separate the character from replacement subpatterns, which contain numbers as well.