Grammalecte  Check-in [660df51ea1]

Overview
Comment:[fr] faux positif (ocr)
Downloads: Tarball | ZIP archive | SQL archive
Timelines: family | ancestors | descendants | both | trunk | fr
Files: files | file ages | folders
SHA3-256: 660df51ea13d45baab0277e7d72373f6c450a52ed30dbb380fcacaae00083567
User & Date: olr on 2019-04-16 16:29:39
Other Links: manifest | tags
Context
2019-04-16
16:42
[fr] faux positif check-in: f42e3d0dc9 user: olr tags: trunk, fr
16:29
[fr] faux positif (ocr) check-in: 660df51ea1 user: olr tags: trunk, fr
16:05
[core] update: paragraph splitting check-in: d0bd3382eb user: olr tags: trunk, core
Changes

Modified gc_lang/fr/rules.grx from [21dbe41916] to [d1d7d695cd].

2191
2192
2193
2194
2195
2196
2197
2198

2199
2200
2201
2202
2203
2204
2205
2191
2192
2193
2194
2195
2196
2197

2198
2199
2200
2201
2202
2203
2204
2205







-
+







@@@@
@@@@

# This graph is parsed only if option <ocr> is activated.

## Casse
__ocr_casse__
    ~^[A-ZÉÈÂÊÎÔ].
    ~^[A-ZÉÈÂÊÎÔ]\w
        <<- \1.istitle() and before("\\w") and (morph(\1, ":G", ":M") or morph(\1, ":[123][sp]", ":[MNA]|>Est/"))
        -:>> =\1.lower()                                                                            # Erreur de numérisation ? Casse improbable.

    [L’|D’|N’|M’|T’|S’|C’|Ç’]  *WORD
        <<- before("\\w") and not \2.isupper() -1:>> =\1.lower()                                    # Erreur de numérisation ? Casse douteuse.

TEST: __ocr__ regarde dans l’obscurité de {{L’}}entrepôt                        ->> l’