模型能力评估基准是指用于系统衡量人工智能模型 […]
SuperGLUE基准(SuperGLUE […]
Winograd Schema Challe […]
指令遵循(Instruction Follo […]
GLUE基准(General Languag […]
常识推理(Common Sense Reas […]
符号推理(Symbolic Reasonin […]
世界知识(World Knowledge)在 […]
推理能力(Reasoning Ability […]
涌现能力(Emergent Abilitie […]