Search code examples
filterwekadocument-classification

StringToWordVector filter under weka


My data are passed through StringToWordVector filter. StringToWordVector can output binary presence/absence indicators, word frequencies or TF-IDF scores. what is the default output of this filter under weka?


Solution

  • According to the options specified in this documentation,

    -C
    Output word counts rather than boolean word presence.
    

    the default output is boolean word presence, and that can be changed by passing it arguments such as -C.