🌟 今日前沿论文 · 2025年6月13日

精选科技前沿资讯，洞察科技研究趋势

🔥🔥🔥🔥🔥 Text-to-SQL

Schema-R1：Text-to-SQL任务中基于推理训练的Schema Linking方法

#SchemaLinking #Text-to-SQL #强化学习 #数据库

论文信息

原始标题：Schema-R1: A reasoning training approach for schema linking in Text-to-SQL Task

作者：Wuzhenghong Wen, Su Pan, yuwei Sun

主题：Artificial Intelligence, Computation and Language, Databases

摘要

英文摘要

Schema linking is a critical step in Text-to-SQL task, aiming to accurately predict the table names and column names required for the SQL query based on the given question. However, current fine-tuning approaches for schema linking models employ a rote-learning paradigm, excessively optimizing for ground truth schema linking outcomes while compromising reasoning ability. This limitation arises because of the difficulty in acquiring a high-quality reasoning sample for downstream tasks. To address this, we propose Schema-R1, a reasoning schema linking model trained using reinforcement learning. Specifically, Schema-R1 consists of three key steps: constructing small batches of high-quality reasoning samples, supervised fine-tuning for cold-start initialization, and rule-based reinforcement learning training. The final results demonstrate that our method effectively enhances the reasoning ability of the schema linking model, achieving a 10\% improvement in filter accuracy compared to the existing method. Our code is available at https://github.com/hongWin/Schema-R1/.

中文摘要

Schema linking是Text-to-SQL任务中的关键步骤，旨在根据给定问题准确预测SQL查询所需的表名和列名。然而，当前schema linking模型的微调方法采用死记硬背的学习范式，过度优化于真实schema linking结果而损害了推理能力。这一限制源于难以获取高质量的下游任务推理样本。为解决这个问题，我们提出了Schema-R1，一个使用强化学习训练的推理schema linking模型。具体而言，Schema-R1包含三个关键步骤：构建小批量高质量推理样本、监督微调用于冷启动初始化，以及基于规则的强化学习训练。最终结果表明，我们的方法有效增强了schema linking模型的推理能力，与现有方法相比实现了10%的过滤准确率提升。我们的代码可在https://github.com/hongWin/Schema-R1/获取。

关键问答

Q: 这篇论文试图解决什么问题？

这篇论文试图解决的问题是当前Text-to-SQL任务中schema linking模型的局限性。具体来说：

死记硬背的学习范式：现有的schema linking模型微调方法过度关注于准确匹配真实schema linking结果，而忽视了模型的推理能力。
推理能力不足：由于难以获取高质量的下游任务推理样本，现有方法在训练过程中无法有效培养模型的推理能力。
性能瓶颈：这种缺乏推理能力的模型在实际应用中可能面临性能瓶颈，特别是在处理复杂查询或未见过的schema时。

论文旨在通过提出一种新的训练方法Schema-R1，来解决这些问题并提升schema linking模型的推理能力和整体性能。

Q: 论文提出的解决方案是什么？

论文提出了Schema-R1，一个基于强化学习的推理schema linking模型，其解决方案包含三个关键步骤：

1. 构建高质量推理样本

精心构建小批量高质量推理样本，这些样本能够有效训练模型的推理能力。
样本设计考虑了Text-to-SQL任务中schema linking的各种复杂情况。

2. 监督微调用于冷启动

使用监督学习对模型进行初步训练，为后续强化学习提供良好的初始参数。
这一步骤确保模型具备基本的schema linking能力。

3. 基于规则的强化学习训练

设计专门的奖励函数，基于规则评估模型的schema linking决策。
通过强化学习优化模型参数，使其在保持准确性的同时增强推理能力。
训练过程鼓励模型学习更通用的schema linking策略，而非简单的模式匹配。

这种三步训练方法使Schema-R1能够有效平衡准确性和推理能力，在复杂场景下表现更优。

Q: 论文取得了什么成果？

论文报告了以下主要成果：

性能提升：Schema-R1在filter accuracy指标上比现有方法提高了10%，显著提升了schema linking的质量。
推理能力增强：实验结果表明，Schema-R1能够更好地处理需要复杂推理的schema linking场景。
泛化能力：模型在面对未见过的schema或复杂查询时表现出更好的泛化能力。
开源实现：论文作者开源了Schema-R1的实现代码，便于研究社区复现和改进。

这些成果表明，通过引入强化学习和专门设计的训练方法，可以有效提升Text-to-SQL系统中schema linking组件的性能。

阅读论文全文查看GitHub代码

📱 科技改变生活，资讯连接未来

更新时间：2025年6月13日 20:30 | 数据来源：权威论文站点

更多

注：数据公开发布，版权出版方所有，不构成任何投资建议

返回