IBM’s Project CodeNet wants to teach AI how to code

IBM has announced Project CodeNet, a large dataset that aims to help teach AI how to understand and even write code.

Project CodeNet was announced at IBM’s Think conference this week and claims to be the largest open-source dataset for code (approximately 10 times the size of the closest.)

CodeNet features 500 million lines of code, 14 million examples, and spans 55 programming languages including Python, C++, Java, Go, COBOL, Pascal, and more.

Projects such...