Some resources about Big Code and Naturalness can be found at A list of datasets used in this area can be found at the appendix of the survey and at


A few university courses are been taught covering aspects of machine learning for code, big code or naturalnness of code. Below there are a few that have publicly availably material.

Please, feel free to submit a pull request to adding more links in this page.

Source{d} has collected a set of links and papers in the area. You can access the list here.