Phi-2 Model checkpoints for Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models

Phi-2 Model checkpoints for the paper Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models

Identifier
Source https://tudatalib.ulb.tu-darmstadt.de/handle/tudatalib/4269
Metadata Access https://tudatalib.ulb.tu-darmstadt.de/oai/openairedata?verb=GetRecord&metadataPrefix=oai_datacite&identifier=oai:tudatalib.ulb.tu-darmstadt.de:tudatalib/4269
Provenance
Creator Puerto, Haritz; Chubakov, Tilek; Zhu, Xiaodan; Tayyar Madabushi, Harish; Gurevych, Iryna
Publisher TU Darmstadt
Contributor TU Darmstadt
Publication Year 2024
Rights CC BY-SA 3.0; info:eu-repo/semantics/openAccess
OpenAccess true
Contact https://tudatalib.ulb.tu-darmstadt.de/page/contact
Representation
Language English
Resource Type Other
Format application/zip
Version 1.0
Discipline Other