2013年5月21日 星期二

RapidMiner與其他資料探勘軟體的比較

我又發現了一個網站,把市面上常見的開放源碼做資料探勘的軟體羅列比較,但由於沒時間翻譯,因此,我把連結貼出來,希望有興趣的人自己上網參考。

網站AI Computer Vision

比較一

比較二

相較於貴森森的SPSS、SAS,RapidMiner在各方面的表現(ex. 文件處理、圖形化介面、操作簡易、指令敘述與演算法)均在中上水準。

以下是我直接把作者覺得的優點和缺點貼上給各位參考

RapidMiner

RapidMiner is an open source statistical and data mining package written in Java.
  • Solid and complete package.
  • It easily reads and writes Excel files and different databases.
  • You program by piping components together in a graphic ETL work flows.
  • If you set up an illegal work flows RapidMiner suggest Quick Fixes to make it legal.
Issues:
  • I only got it to works under Windows, but others have gotten it to work in other environments, see comment below.
  • There are a lot of different ETL modules; it took a while to understand how to use them.
  • First I had a hard time making a comparison between different models. Eventually I found a way: You chose a cross validation and select different models one by one. When you run the model the will all be stored on the result page and you can do comparison there.

希望以上資訊對各位學習上

沒有留言:

張貼留言