Loading...
This site is best viewed in a modern browser with JavaScript enabled.
Something went wrong while trying to load the full version of this site. Try hard-refreshing this page to fix the error.
OpenCompass - evaluating large language models across tasks and datasets.
Xkyer
https://github.com/open-compass/opencompass