Modeling Copilots for
Text-to-Model Translation

LLM-based copilots and datasets for translating natural language descriptions into formal optimization and constraint programming models.

Serdar Kadıoğlu1,2  ·  Karthik Uppuluri1  ·  Akash Singirikonda2

1 AI Center of Excellence, Fidelity Investments 2 Brown University

Abstract

There is growing interest in leveraging large language models (LLMs) for text-to-model translation and optimization tasks. This paper aims to advance this line of research by introducing Text2Model and Text2Zinc. Text2Model is a suite of copilots based on several LLM strategies with varying complexity, along with an online leaderboard. Text2Zinc is a cross-domain dataset for capturing optimization and satisfaction problems specified in natural language, along with an interactive editor with built-in AI assistant. While there is an emerging literature on using LLMs for translating combinatorial problems into formal models, our work is the first attempt to integrate both satisfaction and optimization problems within a unified architecture and dataset. Moreover, our approach is solver-agnostic unlike existing work that focuses on translation to a solver-specific model. To achieve this, we leverage MiniZinc's solver-and-paradigm-agnostic modeling capabilities to formulate combinatorial problems.

Publications

Papers and their venues. Click a title to jump to its full resource card below.

Paper Index
  1. Modeling Copilots for Text-to-Model Translation · INFORMS'26 · arXiv:2604.12955
  2. Fine-tuned MiniZinc Models (Learn2Zinc) · arXiv'26
  3. Text2Zinc: A Cross-Domain Dataset for Modeling in MiniZinc · arXiv'25 · arXiv:2503.10642
  4. Gala: Global LLM Agents for Text-to-Model Translation · arXiv'25 · arXiv:2509.08970
  5. Ner4Opt: Named Entity Recognition for Optimization Modelling · Constraints'24 · CPAIOR'23 · NeurIPS'22

Resources

Our collective body of work on translating natural language optimization problem descriptions into formal optimization and constraint programming models.

Copilot Suite

LLM Copilots

A comprehensive suite of LLM-based modeling copilots for text-to-model translation. Includes multiple strategies of varying complexity — from zero-shot and chain-of-thought prompting to compositional and agentic workflows — along with an online leaderboard for benchmarking and an interactive MiniZinc editor with a built-in AI assistant.

Serdar Kadıoğlu, Karthik Uppuluri, Akash Singirikonda

@misc{kadioglu2026text2model,
  title         = {Modeling Copilots for Text-to-Model Translation},
  author        = {Serdar Kadioglu and Karthik Uppuluri and Akash Singirikonda},
  year          = {2026},
  eprint        = {2604.12955},
  archivePrefix = {arXiv},
  primaryClass  = {cs.AI},
  url           = {https://arxiv.org/abs/2604.12955}
}
Fine-tuned Models

Fine-tuned MiniZinc Models

Fine-tuning state-of-the-art open-source LLMs to generate MiniZinc constraint models directly from natural language problem descriptions. Includes training datasets with plain and chain-of-thought variants, augmented data, and models spanning multiple architectures and sizes — from 0.6B to 20B parameters.

Serdar Kadıoğlu, Karthik Uppuluri, Akash Singirikonda

@misc{kadioglu2026text2model,
  title         = {Modeling Copilots for Text-to-Model Translation},
  author        = {Serdar Kadioglu and Karthik Uppuluri and Akash Singirikonda},
  year          = {2026},
  eprint        = {2604.12955},
  archivePrefix = {arXiv},
  primaryClass  = {cs.AI},
  url           = {https://arxiv.org/abs/2604.12955}
}
Dataset

Text2Zinc

A cross-domain dataset capturing optimization and satisfaction problems specified in natural language text. Leverages MiniZinc's solver-agnostic modeling capabilities to formulate a diverse range of combinatorial problems, bridging natural language specifications with formal constraint models. The first dataset to integrate both satisfaction and optimization problems within a unified framework.

Akash Singirikonda, Serdar Kadıoğlu, Karthik Uppuluri

@misc{singirikonda2025text2zinc,
  title         = {Text2Zinc: A Cross-Domain Dataset for Modeling Optimization and Satisfaction Problems in MiniZinc},
  author        = {Akash Singirikonda and Serdar Kadioglu and Karthik Uppuluri},
  year          = {2025},
  eprint        = {2503.10642},
  archivePrefix = {arXiv},
  primaryClass  = {cs.CL},
  url           = {https://arxiv.org/abs/2503.10642}
}
Multi-Agent

Global LLM Agents (Gala)

Global LLM Agents for text-to-model translation. Introduces collaborative multi-agent workflows where specialized LLM agents work together to decompose, formulate, and validate optimization models from natural language — achieving stronger performance through agent collaboration.

Junyang Cai, Serdar Kadıoğlu, Bistra Dilkina

@misc{cai2025gala,
  title         = {Gala: Global LLM Agents for Text-to-Model Translation},
  author        = {Junyang Cai and Serdar Kadioglu and Bistra Dilkina},
  year          = {2025},
  eprint        = {2509.08970},
  archivePrefix = {arXiv},
  primaryClass  = {cs.AI},
  url           = {https://arxiv.org/abs/2509.08970}
}
NER / NLP

Ner4Opt

Named entity recognition for optimization modeling from natural language. Extracts optimization-related entities — objectives, decision variables, constraints, and parameters — from free-form problem descriptions. Combines classical NLP techniques with fine-tuned transformer architectures and optimization-specific data augmentation.

Serdar Kadıoğlu, Parag Pravin Dakle, Karthik Uppuluri, Regina Politi, Preethi Raghavan, SaiKrishna Rallabandi, Ravisutha Srinivasamurthy

@article{Kadioglu24,
  author    = {Serdar Kadioglu and Parag Pravin Dakle and Karthik Uppuluri and Regina Politi and Preethi Raghavan and SaiKrishna Rallabandi and Ravisutha Srinivasamurthy},
  title     = {Ner4Opt: named entity recognition for optimization modelling from natural language},
  journal   = {Constraints},
  volume    = {29},
  number    = {3-4},
  pages     = {261--299},
  year      = {2024},
  url       = {https://doi.org/10.1007/s10601-024-09376-5},
  doi       = {10.1007/S10601-024-09376-5}
}
@inproceedings{dakle23,
  title     = {Ner4Opt: Named Entity Recognition for Optimization Modelling from Natural Language},
  author    = {Parag Pravin Dakle and Serdar Kadıoğlu and Karthik Uppuluri and Regina Politi and Preethi Raghavan and SaiKrishna Rallabandi and Ravisutha Srinivasamurthy},
  booktitle = {The 20th International Conference on the Integration of Constraint Programming, Artificial Intelligence, and Operations Research (CPAIOR 2023)},
  year      = {2023}
}
@inproceedings{ner4opt2022neurips,
  title     = {Ner4Opt: Named Entity Recognition for Optimization Modelling from Natural Language},
  author    = {Parag Pravin Dakle and Serdar Kadıoğlu and Karthik Uppuluri and Regina Politi and Preethi Raghavan and SaiKrishna Rallabandi and Ravisutha Srinivasamurthy},
  booktitle = {NeurIPS 2022 Workshop on Optimization for Machine Learning (OPT)},
  year      = {2022}
}

Events

Tutorials, workshops, and competitions on LLMs for optimization and constraint programming.

LLMs for Modeling Tutorial IJCAI'26 LLM-Solve Workshop CP'26 LLM-Solve Workshop CP'25 LLMs for CP/OR Master Class CPAIOR'26 LLMs for Optimization Tutorial AAAI'26 LLMs for OR Workshop AAAI'24 NL4Opt Competition NeurIPS'22