Load text Files to python weka wrapper -
i've installed weka python wrapper on windows 7. , tried running sample code:
import weka.core.jvm jvm jvm.start() data_dir = "e:/files/fourth/" weka.core.converters import loader loader = loader("weka.core.converters.textdirectoryloader") datasets = [ data_dir + "file 1", data_dir + "file 2", data_dir + "file 3", data_dir + "file 4", data_dir + "file 5" ] data = loader.load_file(datasets) data.delete_last_attribute() print(data)
and received following error:
traceback (most recent call last): file "c:/python27/weekaa.py", line 16, in <module> data = loader.load_file(datasets) file "c:\python27\lib\site-packages\weka\core\converters.py", line 67, in load_file self.enforce_type(self.jobject, "weka.core.converters.filesourcedconverter") file "c:\python27\lib\site-packages\weka\core\classes.py", line 155, in enforce_type raise typeerror("object not implement or subclass " + intf_or_class + "!") typeerror: object not implement or subclass weka.core.converters.filesourcedconverter!
i tried solution in previous asked question (in stackoverflow) adding class path weka.jar or python-weka-wrapper didn't work. error doesn't appear when loading .arff file type.
is there solution load text files?
note: each file in dataset has set of text document files(for later clustering)
weka's textdirectoryloader
class cannot used python-weka-wrapper version 0.2.2. upcoming release 0.2.3 (or github repository) contain new python wrapper called textdirectoryloader
available weka.core.converters
module allows use class now. answered on python-weka-wrapper mailing list.
from weka.core.converters import textdirectoryloader text_dir = "/the/directory/you/want/to/load" loader = textdirectoryloader(options=["-dir", text_dir, "-f", "-charset", "utf-8"]) data = loader.load() print(unicode(data))
Comments
Post a Comment