Commit graph

8 commits

Author SHA1 Message Date
02ea4cc717
chg: [doc] README improved + funky logo 2020-10-09 21:52:00 +02:00
98a8d8275e
chg: [output] make readable table-like with bold headers
Official request from @C00kie-
2020-10-09 18:36:33 +02:00
10049a69b6
new: [option] --binary to dump in binary format instead of UTF-8 2020-10-08 23:30:57 +02:00
26244739dd
new: [option] Don't flush the redisdb, useful when you want to process multiple files and aggregate the results. 2020-10-08 23:22:00 +02:00
949e41d19f
new: [lemmatized/verbatim] displaying verbatim or lemmatized version is now an option 2020-10-08 23:13:51 +02:00
526f88071c
new: [feature] -s option to display the overall statistics of different tokens seen 2020-08-20 13:28:49 +02:00
dd7c796460
new: [napkin] first release
Napkin is a Python tool to produce statistical analysis of a text.

Analysis features are :

- Verbs frequency
- Nouns frequency
- Digit frequency
- Labels frequency such as (Person, organisation, product, location) as defined in spacy.io [named entities](https://spacy.io/api/annotation#named-entities)
- URL frequency
- Email frequency
- Mention frequency (everything prefixed with an @ symbol)
- Out-Of-Vocabulary (OOV) word frequency meaning any words outside English dictionary
2020-08-19 17:33:04 +02:00
e3e27c7ce9
Initial commit 2020-08-18 16:49:24 +02:00