Term: Fall 2016
- Team #2
- Team members
- team member 1 Duanhong Gao
- team member 2 Yi Jian
- team member 3 Jingwei Li
- team member 4 Aoyuan Liao
- team member 5 Hanqing Shi
- team member 6 Jia Wang
- team member 7 Xiangyu Wu
- team member 8 Yutong Yang
- team member 9 Wanyi Zhang
- Project summary: In this project, we are going to answer questions below
- What types of questions/answers are more popular (w.r.t. Scores) (How to ask/give valuable questions/answers); Predict how long it will take to answer questions
- Aoyuan Liao, Hanqing Shi, Yi Jian
- R & Python question difference (hotness, topics, disadvantages, ...)
- Duanhong Gao, Jia Wang, Xiangyu Wu
- Question tag recommendation engine
- Jingwei Li, Wanyi Zhang, Yutong Yang
- What types of questions/answers are more popular (w.r.t. Scores) (How to ask/give valuable questions/answers); Predict how long it will take to answer questions
- Data source:
- R questions: https://www.kaggle.com/stackoverflow/rquestions
- Python questions: https://www.kaggle.com/datasets?sortBy=hottest&group=featured&search=stack
- 10% questions on programming topics: https://www.kaggle.com/stackoverflow/stacksample
- Each dataset is organized as three tables: Questions.csv, Answers.csv, and Tags.csv
Contribution statement: (default) All team members contributed equally in all stages of this project. All team members approve our work presented in this GitHub repository including this contributions statement.
Following suggestions by RICH FITZJOHN (@richfitz). This folder is orgarnized as follows.
proj/
├── data/
├── doc/
├── figs/
├── lib/
├── meetings/
└── output/
Please see each subfolder for a README file.