AI Benchmark Tool

资讯

程序员自制开源 AI 评分工具，衡量大模型“愚蠢程度”

程序员ionutvi发布开源工具AI Benchmark Tool，可量化评估ChatGPT、Grok等AI模型的“愚蠢程度”，通过140项任务测试准确性、稳定...

人工智能,AI Benchmark Tool,AI 模型,ChatGPT 09月18日 0 0

点击加载更多