Grammalecte  History of graphspell-js/tokenizer.js of cac524f6db61f7be

History of file graphspell-js/tokenizer.js at check-in cac524f6db61f7be

2020-12-02
07:57
[graphspell] tokenizer update file: [5dde8e1191] check-in: [9678e9208c] user: olr, branch: trunk, size: 5504 [annotate] [blame] [check-ins using] [diff]
2020-11-30
15:15
[graphspell][fx] update tokenizer and lexicographer: add symbols and emojis file: [8e6d24c94a] check-in: [b3448ac17f] user: olr, branch: trunk, size: 5289 [annotate] [blame] [check-ins using] [diff]
2020-11-25
20:50
[graphspell][fr][fx] rename tokens file: [9c02b80583] check-in: [6aae160f81] user: olr, branch: trunk, size: 5079 [annotate] [blame] [check-ins using] [diff]
2020-10-02
09:32
[graphspell] tokenizer: token UNDERSCORE file: [7838839417] check-in: [e2313363fe] user: olr, branch: trunk, size: 5088 [annotate] [blame] [check-ins using] [diff]
2020-10-01
14:50
[graphspell] tokenizer: exclude underscore from WORD token [fr] ajustements, écriture inclusive file: [4d88fd06ec] check-in: [cfbaf0ad4e] user: olr, branch: trunk, size: 5018 [annotate] [blame] [check-ins using] [diff]
2020-09-02
09:07
[graphspell] tokenizer: token OTHER as fallback file: [2ede633a83] check-in: [e201630bf5] user: olr, branch: trunk, size: 4970 [annotate] [blame] [check-ins using] [diff]
2020-05-07
10:35
[graphspell] tokenizer and suggestion engine: other apostrophes file: [0e7b889227] check-in: [b68161b398] user: olr, branch: trunk, size: 4909 [annotate] [blame] [check-ins using] [diff]
2020-04-20
18:02
[graphspell] tokenizer: combining diacritics recognition and NFC normalization file: [efabea9cdf] check-in: [3ef2bdb736] user: olr, branch: trunk, size: 4895 [annotate] [blame] [check-ins using] [diff]
2020-01-12
09:33
[graphspell][js] fix tokenizer for HTML markers file: [16e7826100] check-in: [b6c3593f76] user: olr, branch: trunk, size: 4817 [annotate] [blame] [check-ins using] [diff]
2019-09-01
08:22
[graphspell] tokenizer: handles all kinds of apostrophes file: [5a3085a08e] check-in: [1bdedd3133] user: olr, branch: trunk, size: 5025 [annotate] [blame] [check-ins using] [diff]
2019-08-30
09:45
[graphspell] tokenizer: consider presqu’ and quelqu’ as separate words file: [0f58ee1cf3] check-in: [0f0bc77645] user: olr, branch: trunk, size: 5015 [annotate] [blame] [check-ins using] [diff]
2019-07-30
20:06
[graphspell][fr] update tokenizer: ordinals file: [2f6a31a87a] check-in: [dcdb32b057] user: olr, branch: trunk, size: 5001 [annotate] [blame] [check-ins using] [diff]
2019-06-09
06:32
[graphspell] tokenizer: update HOUR file: [282849cd0e] check-in: [1bc78ce87f] user: olr, branch: trunk, size: 4969 [annotate] [blame] [check-ins using] [diff]
2019-05-22
08:24
[graphspell][js] tokenizer: tag SEPARATOR -> PUNC file: [3e1b39918e] check-in: [75bf92c9c2] user: olr, branch: trunk, size: 4939 [annotate] [blame] [check-ins using] [diff]
07:59
[core][graphspell][js] fix regex for \w substitution file: [5b1f96af0f] check-in: [e40149ad94] user: olr, branch: trunk, size: 4949 [annotate] [blame] [check-ins using] [diff]
2019-05-14
15:19
[graphspell] tokenizer: update for HOUR tokens file: [4d861c8e0e] check-in: [63672ef096] user: olr, branch: trunk, size: 4949 [annotate] [blame] [check-ins using] [diff]
2019-05-02
08:16
[graphspell] tokinizer: update file: [f8ee073676] check-in: [7d30bbec37] user: olr, branch: trunk, size: 4946 [annotate] [blame] [check-ins using] [diff]
07:50
[graphspell] tokinizer: update file: [d9b8ecbdba] check-in: [ed3b7acf68] user: olr, branch: trunk, size: 4918 [annotate] [blame] [check-ins using] [diff]
2019-02-22
11:53
[graphspell][fr] tokenisation: +signes €$# (faux positif) file: [c05f88b98c] check-in: [365d3554c7] user: olr, branch: trunk, size: 4900 [annotate] [blame] [check-ins using] [diff]
2018-12-26
18:25
[graphspell][js] fucking \w substitution again file: [1d91386df2] check-in: [7f03f6c55a] user: olr, branch: trunk, size: 4890 [annotate] [blame] [check-ins using] [diff]
18:10
[graphspell][js] fucking \w substitution again file: [eb39282111] check-in: [78254a6629] user: olr, branch: trunk, size: 4874 [annotate] [blame] [check-ins using] [diff]
2018-10-11
15:13
[js] Revert syntax change file: [1dc7e255ce] check-in: [9ae8f0a042] user: IllusionPerdu, branch: nodejs, size: 4864 [annotate] [blame] [check-ins using] [diff]
2018-10-10
09:19
Some change to javascript to work in node file: [541689c69f] check-in: [a3687f4fd3] user: IllusionPerdu, branch: nodejs, size: 4863 [annotate] [blame] [check-ins using] [diff]
2018-09-17
12:06
[graphspell][js] tokenizer: update \w replacement again file: [88bacac87d] check-in: [72f63bddd2] user: olr, branch: rg, size: 4819 [annotate] [blame] [check-ins using] [diff]
10:26
[graphspell] tokenizer: new regex for ordinals (JS still sucks) file: [d579c92281] check-in: [a479b36582] user: olr, branch: rg, size: 4815 [annotate] [blame] [check-ins using] [diff]
09:00
[graphspell] tokenizer: add chars to \w replacement (JS still sucks) file: [49d1f9c490] check-in: [c185b3fc04] user: olr, branch: rg, size: 4803 [annotate] [blame] [check-ins using] [diff]
2018-09-14
09:48
[graphspell][js] tokenizer: fix var init file: [aac6560c8a] check-in: [52d64fd395] user: olr, branch: rg, size: 4775 [annotate] [blame] [check-ins using] [diff]
2018-09-11
18:55
[graphspell][js] tokenizer: don’t use spaces as tokens, yield information token (start/end) file: [d2289444f9] check-in: [d12872816f] user: olr, branch: rg, size: 4799 [annotate] [blame] [check-ins using] [diff]
2018-08-07
08:09
[graphspell][core][fx][tb][js] get rid of console.log() substitutes, assuming now default console is available from everywhere (eh, print now works as expected in JS! The JS pile of shit is decreasing a little) file: [04c83f2aa3] check-in: [8e5a2f5900] user: olr, branch: trunk, size: 4322 [annotate] [blame] [check-ins using] [diff]
2018-07-17
06:42
[graphspell] tokenizer: remove hyphen in number detection (always considered as a separate sign) file: [4a5b091820] check-in: [6950f5898f] user: olr, branch: rg, size: 4387 [annotate] [blame] [check-ins using] [diff]
2018-06-28
08:26
[graphspell][core] tokenizer: rename ACRONYM tokens to WORD_ACRONYM file: [2fadfb42f5] check-in: [ccbbecbd1b] user: olr, branch: rg, size: 4391 [annotate] [blame] [check-ins using] [diff]
08:00
[graphspell] tokenizer: rename ORDINAL tokens to WORD_ORDINAL file: [a185b00a68] check-in: [20dbc28ded] user: olr, branch: rg, size: 4381 [annotate] [blame] [check-ins using] [diff]
07:53
[graphspell][core] tokenizer: rename ELPFX tokens to WORD_ELIDED file: [8dd855b1b3] check-in: [a1b165e276] user: olr, branch: rg, size: 4376 [annotate] [blame] [check-ins using] [diff]
2018-06-18
20:13
[graphspell][fix] tokenizer: new signs file: [5f94dc04ea] check-in: [303c416286] user: olr, branch: rg, size: 4370 [annotate] [blame] [check-ins using] [diff]
20:12
[graphspell] tokenizer: new signs file: [1ab7d18bed] check-in: [da0d308818] user: olr, branch: rg, size: 4372 [annotate] [blame] [check-ins using] [diff]
2018-06-17
13:11
[graphspell] tokenizer: update ordinals file: [2e21533239] check-in: [4be13a74c3] user: olr, branch: rg, size: 4280 [annotate] [blame] [check-ins using] [diff]
2018-05-18
13:11
[graphspell] tokenizer: add token index and avoid punctuations aggregation file: [9bd60cca8a] check-in: [be6d99bbdc] user: olr, branch: rg, size: 4232 [annotate] [blame] [check-ins using] [diff]
2018-02-20
08:40
[graphspell] spellchecker: add parseParagraph() file: [bdd895b918] check-in: [7616aa7ef9] user: olr, branch: trunk, size: 4443 [annotate] [blame] [check-ins using] [diff]
2018-02-13
15:44
[core][cli][server][graphspell][fx] use spellchecker instead of ibdawg file: [c3f0ee8c90] check-in: [18db5d65f0] user: olr, branch: multid, size: 4756 [annotate] [blame] [check-ins using] [diff]
2017-12-25
08:52
[build][tb] fix imports for Thunderbird file: [d6429837c4] check-in: [5822c38ee9] user: olr, branch: graphspell, size: 4740 [annotate] [blame] [check-ins using] [diff]
2017-12-24
18:58
Renamed gc_core/js/tokenizer.js → graphspell-js/tokenizer.js. [build][js] move files from gc_core to graphspell file: [a34a81c6e5] check-in: [bdfc6fd5e9] user: olr, branch: graphspell, size: 4729 [annotate] [blame] [check-ins using] [diff]
2017-11-12
13:22
[core][fx] tokenizer: +acronyms file: [a34a81c6e5] check-in: [fa1205c098] user: olr, branch: Lexicographe, size: 4729 [annotate] [blame] [check-ins using] [diff]
2017-11-02
10:58
Ajout dans le tokenizer du ~ dans la detection des dossier linux, et distintion entre les deux types de dossier windows/linux avec le changement dans le lexicographe file: [9d996e312d] check-in: [11f1414b5b] user: IllusionPerdu, branch: Lexicographe, size: 4613 [annotate] [blame] [check-ins using] [diff]
2017-10-26
05:49
[core] tokenizer: better regex for URLs and folders file: [9bb6ea03fb] check-in: [843c0244bc] user: olr, branch: trunk, size: 4601 [annotate] [blame] [check-ins using] [diff]
2017-10-25
18:34
[core][bug] fix tokenizer for URL file: [de468b4358] check-in: [ee7d44a3ee] user: olr, branch: trunk, size: 4583 [annotate] [blame] [check-ins using] [diff]
2017-10-24
11:00
[core] tokenization: folders file: [9622b0a610] check-in: [35c48d42a8] user: olr, branch: trunk, size: 4561 [annotate] [blame] [check-ins using] [diff]
2017-08-08
19:11
[core] add inline config for JSHint assuming es6 synthax and some global file: [81088a0177] check-in: [b89dc82bc4] user: IllusionPerdu, branch: webext2_fix, size: 3799 [annotate] [blame] [check-ins using] [diff]
16:49
[core][fr] change typeof(exports) before the require block file: [c0e5d0e035] check-in: [6c8ee6edcf] user: IllusionPerdu, branch: webext2_fix, size: 3748 [annotate] [blame] [check-ins using] [diff]
2017-08-06
00:44
[core][js] ajout des ; oubliés et ajustement des ; en trop (commit erroné) file: [fcd058bf6a] check-in: [b08f2ef338] user: IllusionPerdu, branch: webext2_illusion, size: 3748 [annotate] [blame] [check-ins using] [diff]
2017-08-02
06:12
[core][js] conditional requires and variable renaming to avoid overridding file: [02266194ef] check-in: [7e545790c3] user: olr, branch: webext2, size: 3749 [annotate] [blame] [check-ins using] [diff]
2017-07-29
08:06
[core][js] test if <exports> exists file: [4c307bc446] check-in: [c55ae247e7] user: olr, branch: webext, size: 3044 [annotate] [blame] [check-ins using] [diff]
2017-07-27
10:10
[core][js] test if variable <exports> exists file: [a6594366c3] check-in: [f9a034e6ce] user: olr, branch: trunk, size: 3695 [annotate] [blame] [check-ins using] [diff]
2017-07-13
07:08
[core][js] tokenizer: yield separator one by one file: [8720e6c367] check-in: [492169bd6f] user: olr, branch: kill_innerHTML, size: 3649 [annotate] [blame] [check-ins using] [diff]
2017-07-12
11:03
[fr][js] màj: lexicographe, tokenizer file: [4eb5311054] check-in: [074cb33c80] user: olr, branch: kill_innerHTML, size: 3338 [annotate] [blame] [check-ins using] [diff]
2017-04-25
11:51
Added: commit 1 file: [d06151ccd7] check-in: [2fd7dc4dd5] user: olr, branch: trunk, size: 2998 [annotate] [blame] [check-ins using]