Phi-2 Model checkpoints for Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models
Provenance | |
---|---|
Creator | Puerto, Haritz; Chubakov, Tilek; Zhu, Xiaodan; Tayyar Madabushi, Harish; Gurevych, Iryna |
Publisher | TU Darmstadt |
Contributor | TU Darmstadt |
Publication Year | 2024 |
Rights | CC BY-SA 3.0; info:eu-repo/semantics/openAccess |
OpenAccess | true |
Contact | https://tudatalib.ulb.tu-darmstadt.de/page/contact |
Representation | |
---|---|
Language | English |
Resource Type | Other |
Format | application/zip |
Version | 1.0 |
Discipline | Other |