1.http://snap.stanford.edu/index.html。
2.http://www-personal.umich.edu/~mejn/
3.http://deim.urv.cat/~aarenas/data/welcome.htm
4.http://www.correlatesofwar.org/
5.http://www.gmw.rug.nl/~huisman/sna/software.html
6.http://tuvalu.santafe.edu/~aaronc/hierarchy/
7.http://santo.fortunato.googlepages.com/benchmark.tgz
8.The code of the FM algorithm (Clauset et al., 2004)
9.The code of BGLL algorithm (Blondel et al., 2008)
10.The code of Infomap (Rosvall and Bergstrom, 2007) and the Infomap (Rosvall and Bergstrom, 2008a)
algortihm.
11. Memetic-net(Gong et al., 2011)
12.The code of the MODPSO algorithm. (Gong et al., 2014)
12.一些重疊社群的算法代碼。
13.氣候監測資料集
14.黑客資料
15.UCI KDD存檔(各類資料集)
http://kdd.ics.uci.edu/summary.task.type.html
http://kdd.ics.uci.edu/summary.data.type.html
16.UCI收集的機器學習資料集
ftp://pami.sjtu.edu.cn/
http://www.ics.uci.edu/~mlearn//MLRepository.htm
17.樣本資料庫
http://kdd.ics.uci.edu/
WWW頁面被手動分類
http://www-2.cs.cmu.edu/afs/cs.cmu.edu/project/theo-20/www/data/
18.CMU全球知識庫(Web-> KB)項目(分類網頁,關系資料描述頁面和超連結)
http://www-2.cs.cmu.edu/afs/cs.cmu.edu/project/theo-11/www/wwkb/
19.人工智能機器學習
http://duch-links.wikispaces.com/
8,文本分類,即彩虹的資料集
http://www-2.cs.cmu.edu/afs/cs/project/theo-11/www/naive-bayes.html
20.Statlib數理統計相關程式庫
http://liama.ia.ac.cn/SCILAB/scilabindexgb.htm
http://lib.stat.cmu.edu/
http://lib.stat.cmu.edu/datasets/
http://lib.stat.cmu.edu/modules.php?op=modload&name=Downloads&file=index&req=viewdownload&cid=2
21.癌症基因:
http://www.broad.mit.edu/cgi-bin/cancer/datasets.cgi
22.金融,醫藥資料:
http://lisp.vse.cz/pkdd99/Challenge/chall.html
23.時間序列資料的網址
http://www.stat.wisc.edu/~reinsel/bjr-data/
24.kdnuggets相關連結各種資料集:
http://www.kdnuggets.com/datasets/index.html
25.德國智能分析和資訊系統
http://www.mlnet.org/cgi-bin/mlnetois.pl/?File=datasets.html
http://dctc.sjtu.edu.cn/adaptive/datasets/
http://fimi.cs.helsinki.fi/data/
26.IBM智能資訊
http://www-958.ibm.com/software/data/cognos/manyeyes/datasets
http://www.almaden.ibm.com/software/quest/Resources/index.shtml
27.頻繁設定計數
http://miles.cnuce.cnr.it/~palmeri/datam/DCI/datasets.php
28.評分資料集
Movielens電影評分資料
http://www.grouplens.org/
Book-Crossing書籍評分資料
http://www.informatik.uni-freiburg.de/~cziegler/BX/
Jester笑話資料集笑話評分集合
http://www.ieor.berkeley.edu/~goldberg/jester-data/
Netflix資料集
29.GPS軌迹資料
GeoLife GPS軌迹
http://research.microsoft.com/en-us/downloads/b16d359d-d164-469e-9fd4-daa38f2b2e13/default.aspx
GPS軌迹與運輸模式标簽
http://research.microsoft.com/apps/pubs/?id=141896
Movebank動物軌迹
http://www.movebank.org/
30.手機WIFI藍牙
達特茅斯存檔無線資料的社群資源
http://crawdad.cs.dartmouth.edu/
crowflow 手機和wifi的軌迹
http://crowdflow.net/
31.OpenStreetMap資料
planet.openstreetmap.org或者http://metro.teczno.com/
32.openpath上傳資料+ API
https://openpaths.cc/
33.FOURSQUARE
34.GeoTime
http://www.geotime.com/GeoTime(s)/January-2012/Cupid-Strikes-Again–Time-Series—GIS–Together-a.aspx
35.資料堂
http://www.datatang.com/
http://www.kdnuggets.com/datasets/
HTTP://appsrv.cse.cuhk.edu.hk/~kdd/data_collection.html
36.進行文本分類&WEB
http://www-2.cs.cmu.edu/afs/cs/project/theo-11/www/naive-bayes.html
http://www.w3.org/TR/WD-logfile-960221.html
http://www.w3.org/Daemon/User/Config/Logging.html#AccessLog
http://www.w3.org/ 1998/11/05 / WC-workshop / Papers / bala2.html
http://www-2.cs.cmu.edu/afs/cs.cmu.edu/project/theo-11/www/wwkb/
http:/ /www.web-caching.com/traces-logs.html
http://www-2.cs.cmu.edu/webkb
http://www.cs.auc.dk/research/DP/tdb/TimeCenter/TimeCenterPublications /TR-75.pdf
http://www.cs.cornell.edu/projects/kddcup/index.html
37.先驗的算法測試資料
http://www.almaden.ibm.com/cs/quest/syndata.html
38.資料生成器的連結
http://www.cse.cuhk.edu.hk/~kdd/data_collection.html
http://www.almaden.ibm.com/cs/quest/syndata.html
39.THE MNIST DATABASE of handwritten digits
http://yann.lecun.com/exdb/mnist/
40.面部圖像資料集
http://mmlab.ie.cuhk.edu.hk/projects/CelebA.html #Large-scale CelebFaces Attributes (CelebA) Dataset
http://vintage.winklerbros.net/facescrub.htm
http://vis-www.cs.umass.edu/lfw/
http://megaface.cs.washington.edu/
41.生物特征資料集
CASIA WebFace資料庫
http://www.cbsr.ia.ac.cn/english/CASIA-WebFace-Database.html
42.很全資料集資源網址為:
http 😕/kdd.ics.uci.edu/
43.路透資料集
http://www.research.att.com/~lewis/reuters21578.html
44.關于基金的資料挖掘的網站
http://www.gotofund.com/index.asp
http://lans.ece.utexas.edu/~strehl/
http://www-2.cs.cmu.edu/webkb
http://www.cs.auc.dk/research/DP/tdb/TimeCenter/TimeCenterPublications/TR-75.pdf
關聯:
http://flow.dl.sourceforge.net/sourceforge/weka/regression-datasets.jar
http://www.phys.uni.torun.pl/~duch/software.html
WEKA:
http://flow.dl.sourceforge.net/sourceforge/weka/regression-datasets.jar
一個包含37個分類問題的jarfile,最初是從UCI資料庫獲得的
http://prdownloads.sourceforge.net/weka/datasets-UCI.jar
一個包含37個回歸問題的jarfile,從各種來源獲得
http://prdownloads.sourceforge.net/weka/datasets-numeric.jar
一個包含Luis Torgo收集的30個回歸資料集的jarfile
http://prdownloads.sourceforge.net/weka/regression-datasets.jar
http://challenge.ai.iqiyi.com/detail?raceId=5def69ace9fcf68aef76a75d
微信“圖像處理與模式識别研究所”關注我呦