Grammalecte  Check-in [ecdbcd1539]

Overview
Comment:[build] màj: build_data
Downloads: Tarball | ZIP archive | SQL archive
Timelines: family | ancestors | descendants | both | trunk | build
Files: files | file ages | folders
SHA3-256: ecdbcd1539a2283891c81aa22d023e1488d0d0232b0ee77a5e4320aebcaf644b
User & Date: olr on 2017-06-23 23:09:46
Other Links: manifest | tags
Context
2017-06-23
23:23
[fr] phonet_simil: simplification des lignes contenant des verbes du 1ᵉʳ groupe à l’infinitif check-in: a1812bdacc user: olr tags: trunk, fr
23:09
[build] màj: build_data check-in: ecdbcd1539 user: olr tags: trunk, build
22:46
[build] more options for build_data check-in: 891cbb4e9d user: olr tags: trunk, build
Changes

Modified gc_lang/fr/build_data.py from [79b36913b6] to [89b72162ae].

266
267
268
269
270
271
272
273
274
275
276
277
278
279

280
281
282
283
284
285
286
        return

    with open(sp+"/data/phonet_simil.txt", 'r', encoding='utf-8') as hSrc:
        # set of homophonic words
        lSet = []
        for sLine in hSrc.readlines():
            if not sLine.startswith("#") and sLine.strip():
                aWord = set(sLine.strip().split())
                aMore = set()
                for sWord in aWord:
                    if sWord.endswith("er") and conj.isVerb(sWord):
                        aMore = aMore.union(conj.getConjSimilInfiV1(sWord))
                aWord = aWord.union(aMore)
                lSet.append(aWord)

        # dictionary of words
        dWord = {}
        for i, aSet in enumerate(lSet):
            for sWord in aSet:
                if oDict.lookup(sWord):
                    dWord[sWord] = i  # warning, what if word in several sets?
                else:







|

|


|
|
>







266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
        return

    with open(sp+"/data/phonet_simil.txt", 'r', encoding='utf-8') as hSrc:
        # set of homophonic words
        lSet = []
        for sLine in hSrc.readlines():
            if not sLine.startswith("#") and sLine.strip():
                lWord = sLine.strip().split()
                aMore = set()
                for sWord in lWord:
                    if sWord.endswith("er") and conj.isVerb(sWord):
                        aMore = aMore.union(conj.getConjSimilInfiV1(sWord))
                lWord.extend(list(aMore))
                lSet.append(lWord)
                #print(lWord)
        # dictionary of words
        dWord = {}
        for i, aSet in enumerate(lSet):
            for sWord in aSet:
                if oDict.lookup(sWord):
                    dWord[sWord] = i  # warning, what if word in several sets?
                else:

Modified gc_lang/fr/modules/phonet_data.py from [98fafd6c52] to [63ece6deca].

cannot compute difference between binary files