Nebulento

A lightweight fuzzy-matching intent parser built on rapidfuzz.

Finds the closest matching intent by comparing the utterance against all training sentences using configurable fuzzy similarity strategies. Handles spelling errors, word-order variation, contractions, and natural phrasing that exact-match parsers would miss. Best suited for small-to-medium intent sets (dozens to hundreds of training sentences per intent).

Install

pip install nebulento

For the OVOS pipeline plugin:

pip install "nebulento[ovos]"

Quick start

from nebulento import IntentContainer, MatchStrategy

container = IntentContainer(fuzzy_strategy=MatchStrategy.TOKEN_SET_RATIO)

container.add_intent("hello", ["hello", "hi", "how are you", "what's up"])
container.add_intent("buy", ["buy {item}", "purchase {item}", "get {item} for me"])
container.add_entity("item", ["milk", "cheese"])

container.calc_intent("hello")
# {'name': 'hello', 'conf': 1.0, 'entities': {}, 'best_match': 'hello',
#  'utterance': 'hello', 'utterance_consumed': 'hello', 'utterance_remainder': '',
#  'match_strategy': 'TOKEN_SET_RATIO'}

container.calc_intent("buy milk")
# {'name': 'buy', 'conf': 0.719, 'entities': {'item': ['milk']},
#  'best_match': 'buy {item}', ...}

Template syntax

Syntax	Meaning
`(one\|of\|these)`	Alternation — expands to one variant per combination
`[optional]`	Optional word or phrase
`{entity}`	Capture group — matched against registered entity samples

Match strategies

Choose a strategy via IntentContainer(fuzzy_strategy=MatchStrategy.X).

Strategy	Best for	FP risk
`DAMERAU_LEVENSHTEIN_SIMILARITY`	Spelling errors, zero false positives	Low — default
`TOKEN_SET_RATIO`	Natural phrasing, word-order variation	High
`SIMPLE_RATIO`	General use, balanced recall/precision	Medium
`TOKEN_SORT_RATIO`	Same words, different order	Medium
`PARTIAL_RATIO`	Substring presence — avoid for intent gating	Very high

See docs/strategies.md for the full comparison table and benchmark rows.

OVOS pipeline plugin

Nebulento ships as an OVOS pipeline plugin (ovos-nebulento-pipeline-plugin).

{
  "intents": {
    "pipeline": [
      "ovos-nebulento-pipeline-plugin"
    ]
  }
}

Configure the fuzzy strategy and confidence thresholds:

{
  "intents": {
    "nebulento": {
      "strategy": "TOKEN_SET_RATIO",
      "conf_high": 0.95,
      "conf_med":  0.80,
      "conf_low":  0.50
    }
  }
}

Entry point: nebulento.opm:NebulentoPipeline

Documentation

Page	Description
Quickstart	5-minute guide: intents, entities, strategies
Intent API	Full `IntentContainer` and `DomainIntentContainer` reference
Match Strategies	All 9 strategies with benchmark data and decision table
Template Syntax	`(a\|b)`, `[opt]`, `{slot}`, `:0` padatious syntax, expansion rules
Entity Extraction	Registration, confidence boost, result fields
Normalisation	Apostrophes, whitespace, case handling
Domain Matching	`DomainIntentContainer` two-stage matching
OVOS Pipeline Plugin	Bus events, confidence tiers, comparison with Padatious
Configuration	All config keys with types, defaults, and effect
Benchmark	Full accuracy results across all strategies
Troubleshooting	False positives, low recall, entity issues, lru_cache gotchas

Benchmark

268 test cases: 244 natural human utterances across 22 intents, 24 deliberate no-match cases.

Engine	Accuracy	Precision	Recall	F1	False positives	Median
padaos (regex)	25.4%	100%	18.0%	0.306	0 / 24	0.07 ms
padatious (neural)	53.4%	96.9%	50.4%	0.663	4 / 24	1.1 ms
nebulento `token-set-ratio`	50.4%	88.3%	52.5%	0.658	17 / 24	6.3 ms
nebulento `damerau-levenshtein`	38.8%	100%	32.8%	0.494	0 / 24	6.8 ms

python benchmark/compare.py

Credits

Originally an experimental research project by TigreGoticoLda, polished and donated to OpenVoiceOS as part of the NLnet NGI0 Commons Fund under grant agreement No 101135429.

License

Apache 2.0

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
.github		.github
benchmark		benchmark
docs		docs
nebulento		nebulento
test		test
CHANGELOG.md		CHANGELOG.md
LICENSE.md		LICENSE.md
README.md		README.md
ngi.png		ngi.png
pyproject.toml		pyproject.toml
renovate.json		renovate.json
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Nebulento

Install

Quick start

Template syntax

Match strategies

OVOS pipeline plugin

Documentation

Benchmark

Credits

License

About

Uh oh!

Releases 10

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Nebulento

Install

Quick start

Template syntax

Match strategies

OVOS pipeline plugin

Documentation

Benchmark

Credits

License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 10

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages