You can access to the datasets (Docusign + Stack Overflow) for best-answer prediction in legacy forum from here.

The use of the dataset is freely permitted but citation of the original work is required. Please, add to your bibliography the following reference:

F. Calefato, F. Lanubile, N. Novielli. (2016) “Moving to Stack Overflow: Best Answer Prediction in Legacy Developer Forums”, In Proc. 10th Int’l Symposium on Empirical Softw. Eng. and Measurement (ESEM’16), Ciudad Real, Spain, Sept. 8-9, 2016, doi:10.1145/2961111.2962585.


  author = {Calefato, Fabio and Lanubile, Filippo and Novielli, Nicole}, 
  title = {{Moving to Stack Overflow: Best Answer Prediction in Legacy Developer Forums}}, 
  booktitle = {Proc. of the 2016 {IEEE}/{ACM} Int'l Symposium on Empirical Software Engineering and Measurement}, 
  location = {Ciudad Real, Spain},
  articleno = {13},
  numpages = {10},
  pages = {13:1-13:10},
  series = {ESEM '16},
  doi = {10.1145/2961111.2962585},
  url = {},
  year = {2016},
  publisher = {{ACM}}, 