IBM Cloud Docs
IBM watsonx Code Assistant model details

IBM watsonx Code Assistant model details

watsonx Code Assistant

IBM watsonx Code Assistant is designed to accelerate the software development lifecycle and is built on IBM Granite code models.

The public data sources used to train the models include:

General Programming

  • GitHub
  • StarCoder
  • CommitPack
  • Glaive-code-assistant

Java

  • methods2test
  • Jakarta EE source code and documentation
  • MicroProfile source code and documentation

Other (Such as natural language and math)

  • MathInstruct
  • OpenWebMath
  • S2ORC: The Semantic Scholar Open Research Corpus
  • RedPajama 1T
  • Stack Exchange
  • Wikimedia