1f45489cab
replace newlines with spaces
...
whereas before they were simply discarded
2021-12-20 05:24:08 +01:00
4ee9cf5af6
fix(text parsing): actually perform clean-up
2021-12-20 05:21:00 +01:00
e2ca88c247
feat: make this project a module
...
* update .gitignore
* add __init__.py
* add main.py
* add test strings of both smaller and greater size
* call what_this method with test strings as parameters
* call what_this without the text parameter to demonstrate there is a
fallback
2021-12-20 04:57:48 +01:00
94dc7cf46f
add what_this method that interfaces with the user
2021-12-20 04:53:07 +01:00
bd1755d745
make freqs_folder a class field
2021-12-20 04:52:10 +01:00
695e700e35
add attempt_detection method
2021-12-20 04:34:08 +01:00
96bca80d1a
chore: stop printing r_ngrams
2021-12-20 04:24:46 +01:00
77527cb81f
chore: stop calling "internal" methods directly
...
internal in a way they're not supposed to be called directly
2021-12-20 04:23:35 +01:00
ac3a130c92
make missing n-grams have greater overall impact
2021-12-20 04:00:16 +01:00
981fc419bd
chore: add fields with defaults
2021-12-20 03:58:45 +01:00
21a87bbfec
fix: have stable n-grams picked once per pass
...
..so that the detection is performed on the same set for every lang
2021-12-20 02:35:56 +01:00
bda5b336ed
add gimme_probabs_multi_lang
...
this method operates on a list of language probability dictionaries,
passes most of the parameters to other instance methods defined earlier
2021-12-20 01:48:05 +01:00
f09d4177c7
add a "try" wrapper for when getting probabs
2021-12-20 01:36:44 +01:00
789ccd5856
add gimme_probabilities and replace_nones
...
methods for "converting" a list of ngrams into a list of probabilities
these ngrams occur in a given language, replacing "None" values on the
fly with a very low "probability of occurence" value
2021-12-20 01:20:22 +01:00
3650d89fe1
chore: actually return the list of random n-grams
2021-12-20 01:04:55 +01:00
fce2c7c0de
chore: set insanity_threshold dynamically
2021-12-20 00:58:10 +01:00
c6215e6146
add basic checks for langs_to_check, rm local var
2021-12-20 00:50:27 +01:00
1967871779
chore: add a hint to langs_to_check
2021-12-20 00:49:59 +01:00
e942f58f73
fix: by default pick from at least two languages
2021-12-20 00:45:05 +01:00
225b683541
pick_ngrams: complete the implementation
2021-12-17 17:32:49 +01:00
b56662dd87
pick_ngrams: add param how_many + type checks
...
also add a test string test_str
2021-12-17 17:24:04 +01:00
294c16c110
chore: add base of pick_ngrams() method
2021-12-16 05:06:13 +01:00
86f4c3e54d
add type hint to path
2021-12-16 04:59:27 +01:00
dd2f86708b
instantiate da_detector and load jsons
...
for reference purposes, add the json files
2021-12-16 04:42:16 +01:00
0c2b838125
chmod +x, add #! entrypoint
2021-12-16 04:36:46 +01:00
c524ec93d8
add vim metadata
2021-12-16 04:35:13 +01:00
46659d9bd5
add parse_freqs() method
2021-12-16 04:34:24 +01:00
b31df6934e
optimise: mv the try block one level up
2021-12-16 04:29:47 +01:00
e361274b89
da_detector: have a fallback param value of ["sk"]
2021-12-16 04:27:50 +01:00
128db6d440
actually do return the processed data
2021-12-16 03:17:57 +01:00
ef91b2fca7
import just what's needed
2021-12-16 03:17:47 +01:00
e141f2c7b0
ignore vim swp files
2021-12-16 03:17:30 +01:00
c68db63fc8
initial commit
2021-12-16 03:16:47 +01:00