Grammalecte  Check-in [ecdbcd1539]

Overview
Comment:[build] màj: build_data
Downloads: Tarball | ZIP archive | SQL archive
Timelines: family | ancestors | descendants | both | trunk | build
Files: files | file ages | folders
SHA3-256: ecdbcd1539a2283891c81aa22d023e1488d0d0232b0ee77a5e4320aebcaf644b
User & Date: olr on 2017-06-23 23:09:46
Other Links: manifest | tags
Context
2017-06-23
23:23
[fr] phonet_simil: simplification des lignes contenant des verbes du 1ᵉʳ groupe à l’infinitif check-in: a1812bdacc user: olr tags: trunk, fr
23:09
[build] màj: build_data check-in: ecdbcd1539 user: olr tags: trunk, build
22:46
[build] more options for build_data check-in: 891cbb4e9d user: olr tags: trunk, build
Changes

Modified gc_lang/fr/build_data.py from [79b36913b6] to [89b72162ae].

266
267
268
269
270
271
272
273

274
275

276
277
278
279



280
281
282
283
284
285
286
266
267
268
269
270
271
272

273
274

275
276
277


278
279
280
281
282
283
284
285
286
287







-
+

-
+


-
-
+
+
+







        return

    with open(sp+"/data/phonet_simil.txt", 'r', encoding='utf-8') as hSrc:
        # set of homophonic words
        lSet = []
        for sLine in hSrc.readlines():
            if not sLine.startswith("#") and sLine.strip():
                aWord = set(sLine.strip().split())
                lWord = sLine.strip().split()
                aMore = set()
                for sWord in aWord:
                for sWord in lWord:
                    if sWord.endswith("er") and conj.isVerb(sWord):
                        aMore = aMore.union(conj.getConjSimilInfiV1(sWord))
                aWord = aWord.union(aMore)
                lSet.append(aWord)
                lWord.extend(list(aMore))
                lSet.append(lWord)
                #print(lWord)
        # dictionary of words
        dWord = {}
        for i, aSet in enumerate(lSet):
            for sWord in aSet:
                if oDict.lookup(sWord):
                    dWord[sWord] = i  # warning, what if word in several sets?
                else:

Modified gc_lang/fr/modules/phonet_data.py from [98fafd6c52] to [63ece6deca].

cannot compute difference between binary files