【AI大模型应用开发】RAT原理与实现：又是一个提高大模型生成能力的方法（附完整代码）

显示全部楼层

ingFang SC", "Hiragino Sans GB", "Microsoft YaHei UI", "Microsoft YaHei", Arial, sans-serif;border-left: none;padding: 1em;border-radius: 8px;color: rgba(0, 0, 0, 0.5);background: rgb(247, 247, 247);margin: 0px 8px 2em;">如果你对大模型应用有一定了解，一定听说过下面的几个词儿：COT（思维链）、TOT（思维树）、RAG（检索增强生成）、LATS（Language Agent Tree Search）...

ingFang SC", "Hiragino Sans GB", "Microsoft YaHei UI", "Microsoft YaHei", Arial, sans-serif;margin: 1.5em 8px;letter-spacing: 0.1em;color: rgb(63, 63, 63);">大模型相关的领域真的是在高速发展，现在竟然又出现了一个RAT。本文我们来看下RAT是什么，能用来解决什么问题，以及其实现思路和原理。

ingFang SC", "Hiragino Sans GB", "Microsoft YaHei UI", "Microsoft YaHei", Arial, sans-serif;font-size: 1.2em;font-weight: bold;display: table;margin: 2em auto 1em;padding-right: 1em;padding-left: 1em;border-bottom: 2px solid rgb(15, 76, 129);color: rgb(63, 63, 63);">0. 什么是RAT

ingFang SC", "Hiragino Sans GB", "Microsoft YaHei UI", "Microsoft YaHei", Arial, sans-serif;margin: 1.5em 8px;letter-spacing: 0.1em;color: rgb(63, 63, 63);">RAT，全称是 Retrieval Augmented Thoughts，检索增强思维。它的提出是想要用来提高RAG（检索增强生成）的能力，改善RAG效果，减少幻觉。

ingFang SC", "Hiragino Sans GB", "Microsoft YaHei UI", "Microsoft YaHei", Arial, sans-serif;margin: 1.5em 8px;letter-spacing: 0.1em;color: rgb(63, 63, 63);">乍一听高端，但其实很简单：RAT = RAG + COT。即，在RAG的基础上，增加COT思维链。

ingFang SC", "Hiragino Sans GB", "Microsoft YaHei UI", "Microsoft YaHei", Arial, sans-serif;margin: 1.5em 8px;letter-spacing: 0.1em;color: rgb(63, 63, 63);">相关论文：https://arxiv.org/pdf/2403.05313

ingFang SC", "Hiragino Sans GB", "Microsoft YaHei UI", "Microsoft YaHei", Arial, sans-serif;margin: 1.5em 8px;letter-spacing: 0.1em;color: rgb(63, 63, 63);">论文中实验的RAT的实现效果：在代码生成、数学推理、具体任务规划和创意写作等各个方面，RAT的效果都优于直接生成、单用COT、单用RAG的效果（如下图中的下半部分）。

ingFang SC", "Hiragino Sans GB", "Microsoft YaHei UI", "Microsoft YaHei", Arial, sans-serif;margin: 1.5em 8px;letter-spacing: 0.1em;color: rgb(63, 63, 63);">上图的上半部分，论文中展示了相同问题在直接生成、单用COT、单用RAG 和 RAT 4种情况下生成的效果。可以更加直观的感受到RAT的优化效果。

ingFang SC", "Hiragino Sans GB", "Microsoft YaHei UI", "Microsoft YaHei", Arial, sans-serif;font-size: 1.2em;font-weight: bold;display: table;margin: 2em auto 1em;padding-right: 1em;padding-left: 1em;border-bottom: 2px solid rgb(15, 76, 129);color: rgb(63, 63, 63);">1. RAT的原理与实现

1.1 RAT的原理

上面已经提到了，RAT的实现原理其实就是 RAG + COT。其实现步骤如下：

（1）生成初始CoT：给定一个任务提示，LLM首先零样本（zero-shot）生成一个初始的、分步的思维链（CoT），其中包含多个推理步骤。

（2）检索相关信息：对于CoT中的每一步，RAT使用当前和之前所有的思维步骤，以及原始任务提示，生成一个查询（query）。然后，这个查询被用来从外部知识库中检索相关信息。

（3）修正思维步骤：利用检索到的信息，LLM逐步修正CoT中的每一步。这意味着，每一步的修正都基于当前步骤和之前所有已修正步骤的信息。

（4）生成最终回答：修正完所有的思维步骤后，LLM会根据这些修正后的步骤生成最终的输出。

伪代码实现如下：

1.2 RAT的实现

上面有了RAT的原理和基本步骤，可能还是有点懵。别急，下面我们看下RAT的具体实现代码。

1.2.1 实现代码及解析

完整实现代码：https://github.com/CraftJarvis/RAT

下面是RAT的基本流程部分：

#RATFunction
newline_char='\n'

defrat(question):
print(f"{datetime.now()}[INFO]获取草稿...")
draft=get_draft(question)
print(draft)
print(f"{datetime.now()}[INFO]处理草稿...")
draft_paragraphs=split_draft(draft)

answer=""
fori,pinenumerate(draft_paragraphs):
print(str(i)*80)
print(f"{datetime.now()}[INFO]修改第{i+1}/{len(draft_paragraphs)}部分...")
answer=answer+'\n\n'+p

print(f"{datetime.now()}[INFO]生成对应Query...")
res=run_with_timeout(get_query_wrapper,3,question,answer)
ifnotres:
print(f"{datetime.now()}[INFO]跳过后续步骤...")
continue
else:
query=res
print(f">>>{i}/{len(draft_paragraphs)}Query:{query.replace(newline_char,'')}")

print(f"{datetime.now()}[INFO]获取网页内容...")
#content=get_content(query)
res=run_with_timeout(get_content_wrapper,5,query)
ifnotres:
print(f"{datetime.now()}[INFO]跳过后续步骤...")
continue
else:
content=res

forj,cinenumerate(content):
ifj>2:
break
print(f"{datetime.now()}[INFO]根据网页内容修改对应答案...[{j}/{min(len(content),3)}]")
#answer=get_revise_answer(question,answer,c)
res=run_with_timeout(get_revise_answer_wrapper,10,question,answer,c)
ifnotres:
print(f"{datetime.now()}[INFO]跳过后续步骤...")
continue
else:
diff_html=generate_diff_html(answer,res)
display(HTML(diff_html))
answer=res
print(f"{datetime.now()}[INFO]答案修改完成[{j}/{min(len(content),3)}]")
returndraft,answer

对以上代码的解释：

（1）生成初始CoT：draft = get_draft(question)

（2）然后切分出CoT的步骤（论文中的T1、T2、... 、Tn）：draft_paragraphs = split_draft(draft)

（3）生成对应Query：res = run_with_timeout(get_query_wrapper, 3, question, answer)

defget_query(question,answer):
query_prompt='''
Iwanttoverifythecontentcorrectnessofthegivenquestion,especiallythelastsentences.
Pleasesummarizethecontentwiththecorrespondingquestion.
ThissummarizationwillbeusedasaquerytosearchwithBingsearchengine.
ThequeryshouldbeshortbutneedtobespecifictopromiseBingcanfindrelatedknowledgeorpages.
Youcanalsousesearchsyntaxtomakethequeryshortandclearenoughforthesearchenginetofindrelevantlanguagedata.
Trytomakethequeryasrelevantaspossibletothelastfewsentencesinthecontent.
**IMPORTANT**
Justoutputthequerydirectly.DONOTaddadditionalexplanationsorintroducementintheanswerunlessyouareaskedto.
'''
query=openai_client.chat.completions.create(
model="gpt-3.5-turbo",
messages=[
{
"role":"system",
"content":chatgpt_system_prompt
},
{
"role":"user",
"content":f"##Question:{question}\n\n##Content:{answer}\n\n##Instruction:{query_prompt}"
}
],
temperature=1.0
).choices[0].message.content
returnquery

这里值得注意的一点，也是我觉得比较核心的一点：answer = answer + '\n\n' + p，在生成Query的时候，将之前的答案和对应的段落拼接在一起，然后一起总结。

（4）根据生成的Query检索内容：res = run_with_timeout(get_content_wrapper, 5, query)

defget_content(query):
res=get_search(query,1)
ifnotres:
print(">>>NogoodGoogleSearchResultwasfound")
returnNone
search_results=res[0]
link=search_results['link']#title,snippet
res=get_page_content(link)
ifnotres:
print(f">>>Nocontentwasfoundin{link}")
returnNone
retrieved_text=res
trunked_texts=chunk_texts(retrieved_text,1500)
trunked_texts=[trunked_text.replace('\n',"")fortrunked_textintrunked_texts]
returntrunked_texts

（5）根据检索内容修正答案：res = run_with_timeout(get_revise_answer_wrapper, 10, question, answer, c)

defget_revise_answer(question,answer,content):
revise_prompt='''
IwanttorevisetheansweraccordingtoretrievedrelatedtextofthequestioninWIKIpages.
Youneedtocheckwhethertheansweriscorrect.
Ifyoufindsomeerrorsintheanswer,revisetheanswertomakeitbetter.
Ifyoufindsomenecessarydetailsareignored,addittomaketheanswermoreplausibleaccordingtotherelatedtext.
Ifyoufindtheanswerisrightanddonotneedtoaddmoredetails,justoutputtheoriginalanswerdirectly.
**IMPORTANT**
Trytokeepthestructure(multipleparagraphswithitssubtitles)intherevisedanswerandmakeitmorestructualforunderstanding.
Splittheparagraphswith`\n\n`characters.
Justoutputtherevisedanswerdirectly.DONOTaddadditionalexplanationsorannoucementintherevisedanswerunlessyouareaskedto.
'''
revised_answer=openai_client.chat.completions.create(
model="gpt-3.5-turbo",
messages=[
{
"role":"system",
"content":chatgpt_system_prompt
},
{
"role":"user",
"content":f"##ExistingTextinWikiWeb:{content}\n\n##Question:{question}\n\n##Answer:{answer}\n\n##Instruction:{revise_prompt}"
}
],
temperature=1.0
).choices[0].message.content
returnrevised_answer