Sitemap
A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.
Pages
Posts
为不确定性建立秩序:LLM API 网关的设计与实现
Published:
LLM API 网关不是“把几个模型接口包一层”这么简单。对 AI-native 产品来说,它更像一层执行控制面:在 MaaS(Model-as-a-Service)、对象存储、任务状态机、内容安全、成本统计和合规审计之间建立秩序。本文从一个 AI 漫画平台的自建 LLMAPI 模块实践出发,讨论为什么通用聚合层还不够、媒体生成任务会怎样放大网关价值,以及以后面向国内小微团队的低门槛大模型网关可能长成什么样🤔。
向前进,莫彷徨,黑暗尽处有曙光
Published:
第一篇博客文章,被考研折磨一年后的碎碎念🥱
portfolio
2024相册
大二下大三上
2025相册
大三下大四上
publications
QDA-SQL: Questions Enhanced Dialogue Augmentation for Multi-Turn Text-to-SQL
Published in Arxiv, 2024
Fine-tuning LLMs for Text-to-SQL tasks is effective, but they often struggle with multi-turn queries due to ambiguity. QDA-SQL, a data augmentation method that generates diverse multi-turn Q&A pairs using LLMs. Fine-tuning with QDA-SQL improves SQL statement accuracy and enhances the models' ability to manage challenging, unanswerable questions. The generation script and test set are released at Github.
Evaluating and Enhancing LLMs for Multi-turn Text-to-SQL with Multiple Question Types
Published in International Joint Conference on Neural Networks (IJCNN), 2025
Developed MMSQL, a test suite that evaluates how well LLMs manage different question types and multi-turn interactions. Additionally, created a multi-agent system to better identify question types and select appropriate strategies. Experiments show that this approach enhances the models' ability to navigate conversational complexities. For a more detailed presentation, refer to the Page. IEEE Arxiv
AdaCOS: adaptive differential privacy shuffle model based on cosine similarity
Published in Journal of King Saud University Computer and Information Sciences, 2026
AdaCOS makes federated learning training dynamic and smart. Instead of treating every device the same, it constantly evaluates their updates. For devices contributing valuable, on-target information, it intelligently increases their communication allowance while reducing protective noise.
talks
Conference Proceeding talk on MMSQL testsuit
Published:
presentation of paper “Evaluating and Enhancing LLMs for Multi-turn Text-to-SQL with Multiple Question Types” Bilibili
teaching
Teaching experience 2
Workshop, University 1, Department, 2015
This is a description of a teaching experience. You can use markdown like any other post.