基于知识库检索反馈的期货市场智能合规方法研究
Research on the Intelligent Compliance Methods in the Futures Market Based on Knowledge Base Retrieval Feedback
随着人工智能在高合规性领域的深入应用,如何在保障模型表达能够满足严格法规约束,成为大语言模型面临的核心挑战。针对现有方法存在的标注成本高、难以适应动态监管场景及偏好与规则冲突等问题,本文提出一种基于合规知识库反馈的强化学习微调框架(RLKBF,Reinforcement Learning from Knowledge Base Feedback)。该方法结合语义增强的向量表示与层次化检索机制,构建结构化法规知识库。同时,引入合规偏离惩罚项的双月标优化策略,以协调用户意图与法规规范的平衡。实验结果显示,RLKBF 在回答准确率、合规性稳定性及专家评估指标上均优于主流对比模型,显著提升了模型对专业法规知识的整合与应用能力。
As artificial intelligence continues to be applied in highly regulated domains,ensuring that large language models comply with strict legal constraints has become a central challenge.Existing approaches often face limitations such as high annotation costs, poor adaptability to evolving regulatory environments,and conflicts between user preferences and compliance requirements.To address these issues,this paper proposes a fine-tuning framework based on Reinforcement Learning from Knowledge Base Feedback(RLK- BF).This method constructs a structured legal knowledge base using semantically enhanced vector representations and hierarchical re- trieval mechanisms.Additionally,a dual-objective optimization strategy is introduced,incorporating a compliance deviation penalty to balance user intent with regulatory constraints.Experimental results demonstrate that RLKBF outperforms mainstream models in terms of response accuracy,compliance stability,and expert evaluation metrics,significantly improving the model's ability to integrate and ap- ply domain-specific legal knowledge.
| [1] |
|
| [2] |
|
| [3] |
|
| [4] |
|
| [5] |
|
| [6] |
|
| [7] |
|
| [8] |
|
| [9] |
|
| [10] |
|
| [11] |
|
| [12] |
|
| [13] |
|
| [14] |
|
| [15] |
|
| [16] |
|
| [17] |
|
| [18] |
|
| [19] |
|
| [20] |
|
| [21] |
|
| [22] |
|
国家自然科学基金项目(71971066)
/
| 〈 |
|
〉 |