lexicon.low-lexical-diversity
What it flags
Passages with excessive repetition of content words. A monotonous text loses reader attention and often signals unstructured thinking. The rule is not an anti-jargon detector: technical terms (API, request, cache) are expected to repeat โ the signal targets non-technical content words.
At a glance
| Category | lexicon |
| Default severity | info |
| Default weight | 1 |
| Languages | EN ยท FR (separate stoplists) |
| Source | src/rules/low_lexical_diversity.rs |
Detection
Sliding window of window_size words. Within the window, compute unique_words / total_words over non-stopword, non-code-block tokens. Fire when the ratio falls below min_ratio.
Parameters
| Key | Type | dev-doc | public | falc |
|---|---|---|---|---|
window_size | int | 100 | 100 | 80 |
min_ratio | float | 0.40 | 0.50 | 0.55 |
use_stoplist | bool | true | true | true |
Suppression
References
See References for the full bibliography.