matchzoo.preprocessors.units.stop_removal¶
Module Contents¶
Classes¶
Process unit to remove stop words. |
-
class
matchzoo.preprocessors.units.stop_removal.StopRemoval(lang: str = 'english')¶ Bases:
matchzoo.preprocessors.units.unit.UnitProcess unit to remove stop words.
Example
>>> unit = StopRemoval() >>> unit.transform(['a', 'the', 'test']) ['test'] >>> type(unit.stopwords) <class 'list'>
-
transform(self, input_: list) → list¶ Remove stopwords from list of tokenized tokens.
- Parameters
input – list of tokenized tokens.
lang – language code for stopwords.
- Return tokens
list of tokenized tokens without stopwords.
-
property
stopwords(self) → list¶ Get stopwords based on language.
- Params lang
language code.
- Returns
list of stop words.
-