开放共享数据

2017-01-10 | 来源：本站原创

【字体：大中小】打印

#重要#使用数据请引用相关论文！

[1]math2015.rar

1. There are three directories in the current folder, and each directory contains a piece of data used in our paper as follows:

FrcSub-----------------The public dataset, widely used in cognitive modelling (e.g., [Tatsuoka, 1984; Junker and Sijtsma, 2001; DeCarlo, 2010]), is made up of test responses (right or wrong, coded to 1 or 0) of examinees on Fraction-Substraction problems.

Math1&Math2------------The private datasets we used include two final math examination results (scores of each examinee on each problem) of a high school.

2. There are four files in each directory as follows:

data.txt---------------The responses or normalized scores (which are scaled in range [0,1] by dividing full scores of each problem) of each examinee on each problems, and a row denotes an examinee while a column stands for a problem.

qnames.txt-------------The detailed names or meanings of related specific skill.

q.txt------------------The indicator matrix of relationship between problems and skills, which derives from experienced education experts. And a row represents a problem while a column for a skill. E.g., problem i requires skill k if entry(i, k) equals to 1 and vice versa.

problemdesc.txt--------The description of each problem, including the problem type (objective or subjective) and full scores of each problem (set to 1 for all the problems in FrcSub dataset).

3. Besides, there is one more file in Math1 and Math2 directories.

rawdata.txt------------The raw unnormalized scores of the Math1 and Math2 datasets.

4. For better understanding, we give two examples of how to use the datasets in the file "Example.txt" in the current folder.

5. And if you intend to use the two private datasets (called Math dataset) for any exploratory analysis, please refer to the Terms of Use, which is decribed in the file "TermsOfUse.txt" in detail.

Please include the following reference in your publication:

Runze Wu, Qi Liu, Yuping Liu, Enhong Chen, Yu Su, Zhigang Chen and Guoping Hu. "Cognitive Modelling for Predicting Examinee Performance." In Proceedings of the Twenty-Fourth international joint conference on Artificial Intelligence. AAAI Press, 2015.

[2]Data.rar

This data set is extracted from DBLP (http://dblp.uni-trier.de/xml/), the Input files are:

All\Author.txt -->Author/researcher information

All\AimedPaper.txt -->Selected papers

All\paperCite.txt -->paper citation information

All\Editor Board\.. --> editorial/organizing committee information

Each file in PageRankTop10 stores ID of the Top-10 researchers chosen by PageRank algorithm in this domain(In corresponding to the table "An illustration of each domain’s Top-10 researchers mined by the methods with different priors." of the IJCAI13 paper ).

Similarly, the files in PageRankTop50 are the Top-50 researchers chosen by PageRank algorithm.

Please include the following reference in your publication:

Qi Liu, Biao Xiang, Nicholas Jing Yuan, Enhong Chen, Hui Xiong, Yi Zheng, Yu Yang, An Influence Propagation View of PageRank, ACM Trans. KDD.

[3]Editor Board.rar

This data set consists of Editorial/Organizing Committee Data of some conferences.

Please include the following reference in your publication:

Qi Liu, Biao Xiang, Nicholas Jing Yuan, Enhong Chen, Hui Xiong, Yi Zheng, Yu Yang, An Influence Propagation View of PageRank, ACM Trans. KDD.

Editor Board math2015 Data

开放共享数据

相关阅读