generated from scikit-learn-contrib/project-template
-
Notifications
You must be signed in to change notification settings - Fork 26
Open
Description
Wondering your thoughts on something like the Sklearn CountVectorizer option of binary=True for easy use in cases when counts don't matter...I know it's easy to do after the fact, but it could be a nice option to have where it makes sense (like NgramVectorizer). (Maybe it already exists, but I didn't see it). It would make it free to use as part of DocVectorizer as well. Otherwise I think you have to break the whole pipeline apart and "do it by hand" just to binarize a matrix.
Metadata
Metadata
Assignees
Labels
No labels