Commit Graph

15 Commits

Author SHA1 Message Date
11b 4a1784f8a1 feat: add options to control text generation on the UI 2023-01-04 09:06:17 -03:00
11b aebd405bbd feat: proper checkpoint resume in CLM fine-tune script 2022-12-27 13:21:20 -03:00
11b e99277ec52 feat: log LR in CLM fine-tune script 2022-12-27 13:21:00 -03:00
11b 3e798f6767 fix: rename folder so import actually works 2022-12-26 20:44:35 -03:00
11b 93e283daee feat: implement utility to convert ColossalAI checkpoints to HF pre-trained model 2022-12-26 20:43:01 -03:00
11b b79ac657a4 fix: haru's sft being incompatible with the ColossalAI fine-tune script 2022-12-26 20:42:48 -03:00
11b bcbf0910b4 feat: add supervised fine-tuning code based on haru's work
Warning: Absolutely atrocious code quality. I did just the bare minimum to make it run.
2022-12-26 17:31:00 -03:00
11b 4f794489ac feat: add support for fine-tuning GPT-NeoX-based models, save optimizer and LR scheduler to checkpoint 2022-12-25 15:42:59 -03:00
11b 186df60691 feat: update inference code for pythia/cai data-based models 2022-12-25 15:39:28 -03:00
11b 5b26097905 feat: implement Gradio UI for proper model inference (WIP) 2022-12-24 12:12:55 -03:00
11b 30ff3751de feat: allow fine-tuning of non-OPT LMs 2022-12-18 22:24:43 -03:00
11b 25ae9da046 fix: tensorboard getting messed up after epoch end 2022-12-18 22:24:26 -03:00
11b 877b1bc56b feat: barebones inference script for testing checkpoints 2022-12-18 17:29:15 -03:00
11b b0d2d80ac3 feat: minor improvements to the fine-tune script 2022-12-18 17:29:15 -03:00
11b 925f5767ec feat: fine-tuning scripts and instructions 2022-12-18 17:29:15 -03:00