IBM watsonx Code Assistant model details
watsonx Code Assistant
IBM watsonx Code Assistant is designed to accelerate the software development lifecycle and is built on IBM Granite code models.
The public data sources used to train the models include:
General Programming
- GitHub
- StarCoder
- CommitPack
- Glaive-code-assistant
Java
- methods2test
- Jakarta EE source code and documentation
- MicroProfile source code and documentation
Other (Such as natural language and math)
- MathInstruct
- OpenWebMath
- S2ORC: The Semantic Scholar Open Research Corpus
- RedPajama 1T
- Stack Exchange
- Wikimedia