scikit (sklearn)#

Scikit-learn is a Python package for machine learning, implementing various interfaces for deploying vectorizers.

Dependencies#

No additional depedencies need to be installed to use scikit-learn. It works out of the box!

API#

class vecworks.vectorizers.scikit.sklearnVectorizer#

Wrapper class to ease use of scikit-learn vectorizers in Vecworks.

Interface#

__init__(vectorizer: TransformerMixin | Pipeline, fit: Iterable[str] | None = None)#

Initializes the vectorizer.

Parameters#

vectorizer

Sci-kit learn transformer to be used to vectorize the data.

fit

Optional, data with which the transformer may be fit.

transform(input: Any | Iterable[Any]) ndarray | sparray#

Vectorizes the given data.

Also see: Vectorizer.