天天看點

【資料集】資料集總結

1.http://snap.stanford.edu/index.html。

2.http://www-personal.umich.edu/~mejn/

3.http://deim.urv.cat/~aarenas/data/welcome.htm

4.http://www.correlatesofwar.org/

5.http://www.gmw.rug.nl/~huisman/sna/software.html

6.http://tuvalu.santafe.edu/~aaronc/hierarchy/

7.http://santo.fortunato.googlepages.com/benchmark.tgz

8.The code of the FM algorithm (Clauset et al., 2004)

9.The code of BGLL algorithm (Blondel et al., 2008)

10.The code of Infomap (Rosvall and Bergstrom, 2007) and the Infomap (Rosvall and Bergstrom, 2008a)

algortihm.

11. Memetic-net(Gong et al., 2011)

12.The code of the MODPSO algorithm. (Gong et al., 2014)

12.一些重疊社群的算法代碼。

13.氣候監測資料集

14.黑客資料

15.UCI KDD存檔(各類資料集)

http://kdd.ics.uci.edu/summary.task.type.html

http://kdd.ics.uci.edu/summary.data.type.html

16.UCI收集的機器學習資料集

ftp://pami.sjtu.edu.cn/

http://www.ics.uci.edu/~mlearn//MLRepository.htm

17.樣本資料庫

http://kdd.ics.uci.edu/

WWW頁面被手動分類

http://www-2.cs.cmu.edu/afs/cs.cmu.edu/project/theo-20/www/data/

18.CMU全球知識庫(Web-> KB)項目(分類網頁,關系資料描述頁面和超連結)

http://www-2.cs.cmu.edu/afs/cs.cmu.edu/project/theo-11/www/wwkb/

19.人工智能機器學習

http://duch-links.wikispaces.com/

8,文本分類,即彩虹的資料集

http://www-2.cs.cmu.edu/afs/cs/project/theo-11/www/naive-bayes.html

20.Statlib數理統計相關程式庫

http://liama.ia.ac.cn/SCILAB/scilabindexgb.htm

http://lib.stat.cmu.edu/

http://lib.stat.cmu.edu/datasets/

http://lib.stat.cmu.edu/modules.php?op=modload&name=Downloads&file=index&req=viewdownload&cid=2

21.癌症基因:

http://www.broad.mit.edu/cgi-bin/cancer/datasets.cgi

22.金融,醫藥資料:

http://lisp.vse.cz/pkdd99/Challenge/chall.html

23.時間序列資料的網址

http://www.stat.wisc.edu/~reinsel/bjr-data/

24.kdnuggets相關連結各種資料集:

http://www.kdnuggets.com/datasets/index.html

25.德國智能分析和資訊系統

http://www.mlnet.org/cgi-bin/mlnetois.pl/?File=datasets.html

http://dctc.sjtu.edu.cn/adaptive/datasets/

http://fimi.cs.helsinki.fi/data/

26.IBM智能資訊

http://www-958.ibm.com/software/data/cognos/manyeyes/datasets

http://www.almaden.ibm.com/software/quest/Resources/index.shtml

27.頻繁設定計數

http://miles.cnuce.cnr.it/~palmeri/da​​tam/DCI/datasets.php

28.評分資料集

Movielens電影評分資料

http://www.grouplens.org/

Book-Crossing書籍評分資料

http://www.informatik.uni-freiburg.de/~cziegler/BX/

Jester笑話資料集笑話評分集合

http://www.ieor.berkeley.edu/~goldberg/jester-data/

Netflix資料集

29.GPS軌迹資料

GeoLife GPS軌迹

http://research.microsoft.com/en-us/downloads/b16d359d-d164-469e-9fd4-daa38f2b2e13/default.aspx

GPS軌迹與運輸模式标簽

http://research.microsoft.com/apps/pubs/?id=141896

Movebank動物軌迹

http://www.movebank.org/

30.手機WIFI藍牙

達特茅斯存檔無線資料的社群資源

http://crawdad.cs.dartmouth.edu/

crowflow 手機和wifi的軌迹

http://crowdflow.net/

31.OpenStreetMap資料

planet.openstreetmap.org或者http://metro.teczno.com/

32.openpath上傳資料+ API

https://openpaths.cc/

33.FOURSQUARE

34.GeoTime

http://www.geotime.com/GeoTime(s)/January-2012/Cupid-Strikes-Again–Time-Series—GIS–Together-a.aspx

35.資料堂

http://www.datatang.com/

http://www.kdnuggets.com/datasets/

HTTP://appsrv.cse.cuhk.edu.hk/~kdd/data_collection.html

36.進行文本分類&WEB

http://www-2.cs.cmu.edu/afs/cs/project/theo-11/www/naive-bayes.html

http://www.w3.org/TR/WD-logfile-960221.html

http://www.w3.org/Daemon/User/Config/Logging.html#AccessLog

http://www.w3.org/ 1998/11/05 / WC-workshop / Papers / bala2.html

http://www-2.cs.cmu.edu/afs/cs.cmu.edu/project/theo-11/www/wwkb/

http:/ /www.web-caching.com/traces-logs.html

http://www-2.cs.cmu.edu/webkb

http://www.cs.auc.dk/research/DP/tdb/TimeCenter/TimeCenterPublications /TR-75.pdf

http://www.cs.cornell.edu/projects/kddcup/index.html

37.先驗的算法測試資料

http://www.almaden.ibm.com/cs/quest/syndata.html

38.資料生成器的連結

http://www.cse.cuhk.edu.hk/~kdd/data_collection.html

http://www.almaden.ibm.com/cs/quest/syndata.html

39.THE MNIST DATABASE of handwritten digits

http://yann.lecun.com/exdb/mnist/

40.面部圖像資料集

http://mmlab.ie.cuhk.edu.hk/projects/CelebA.html #Large-scale CelebFaces Attributes (CelebA) Dataset

http://vintage.winklerbros.net/facescrub.htm

http://vis-www.cs.umass.edu/lfw/

http://megaface.cs.washington.edu/

41.生物特征資料集

CASIA WebFace資料庫

http://www.cbsr.ia.ac.cn/english/CASIA-WebFace-Database.html

42.很全資料集資源網址為:

http 😕/kdd.ics.uci.edu/

43.路透資料集

http://www.research.att.com/~lewis/reuters21578.html

44.關于基金的資料挖掘的網站

http://www.gotofund.com/index.asp

http://lans.ece.utexas.edu/~strehl/

http://www-2.cs.cmu.edu/webkb

http://www.cs.auc.dk/research/DP/tdb/TimeCenter/TimeCenterPublications/TR-75.pdf

關聯:

http://flow.dl.sourceforge.net/sourceforge/weka/regression-datasets.jar

http://www.phys.uni.torun.pl/~duch/software.html

WEKA:

http://flow.dl.sourceforge.net/sourceforge/weka/regression-datasets.jar

一個包含37個分類問題的jarfile,最初是從UCI資料庫獲得的

http://prdownloads.sourceforge.net/weka/datasets-UCI.jar

一個包含37個回歸問題的jarfile,從各種來源獲得

http://prdownloads.sourceforge.net/weka/datasets-numeric.jar

一個包含Luis Torgo收集的30個回歸資料集的jarfile

http://prdownloads.sourceforge.net/weka/regression-datasets.jar

http://challenge.ai.iqiyi.com/detail?raceId=5def69ace9fcf68aef76a75d

微信“圖像處理與模式識别研究所”關注我呦

繼續閱讀