Overview
Comment: | [graphspell][py] dawg builder: filter entries with regex |
---|---|
Downloads: | Tarball | ZIP archive | SQL archive |
Timelines: | family | ancestors | descendants | both | trunk | graphspell |
Files: | files | file ages | folders |
SHA3-256: |
96692bb88326c20e8587e6e98a30cdcd |
User & Date: | olr on 2018-05-04 08:16:18 |
Other Links: | manifest | tags |
Context
2018-05-04
| ||
10:29 | [build][graphspell] multiple main dictionaries check-in: ca4c833876 user: olr tags: trunk, build, graphspell | |
08:16 | [graphspell][py] dawg builder: filter entries with regex check-in: 96692bb883 user: olr tags: trunk, graphspell | |
2018-05-03
| ||
11:33 | [graphspell][py][bug] fix words selection check-in: 0eaff45865 user: olr tags: trunk, graphspell | |
Changes
Modified graphspell/dawg.py from [63684196d2] to [64364f5bf4].
︙ | |||
37 38 39 40 41 42 43 | 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 | - + + + + + + + + - - - - - - - - - - - - - - - - - - - - - - + + + + + + + + + + + + + + + + + + + + + + | """DIRECT ACYCLIC WORD GRAPH""" # This code is inspired from Steve Hanov’s DAWG, 2011. (http://stevehanov.ca/blog/index.php?id=115) # We store suffix/affix codes and tags within the graph after the “real” word. # A word is a list of numbers [ c1, c2, c3 . . . cN, iAffix, iTags] # Each arc is an index in self.lArcVal, where are stored characters, suffix/affix codes for stemming and tags. # Important: As usual, the last node (after ‘iTags’) is tagged final, AND the node after ‘cN’ is ALSO tagged final. |
︙ |