Extract features from texts
What the MR does
- added module to extract important features (language, words, sentences...) from texts (documented and tested)
- notebook checking generation & analysing texts (originals and generated).
Checklist
-
Linting and typing are OK, -
Licenses are OK (licensecheck + integration with reuse with the proper licenses), -
You have tested this MR locally, -
You have considered performance issues, -
You have considered availability and reliability risks, -
You have updated documentation if necessary.
Linked to issue #19