近端策略优化(Proximal Policy […]
奖励模型(Reward Model)是强化学 […]
提示工程(Prompt Engineerin […]
一样本学习(One-shot Learnin […]
少样本学习(Few-shot Learnin […]
零样本学习(Zero-shot Learni […]
上下文学习(In-context Learn […]
检索增强生成(Retrieval-Augme […]
向量数据库(Vector Database) […]
嵌入(Embedding)是人工智能领域中的 […]