toolbox/waifu
TearGosling ea162de2e0 feat: add SODA dataset
* Very first prototype of SODA dataset support

I'm also bringing over the version of PromptConstants from the dev branch due to needing CHAT_START_TOKEN

* More flexibility when fetching speaker names

* Make SODA a PDM instead of a VDM

* Swap order of speakers based on relation

* Oh, and fix a typo too

* Bugfix
2023-01-08 15:48:52 -03:00
..
core feat: alternative way of handling and augmenting episode data (wip) 2023-01-04 09:05:51 -03:00
datasets feat: add SODA dataset 2023-01-08 15:48:52 -03:00
modules feat: add SODA dataset 2023-01-08 15:48:52 -03:00
scripts feat: alternative way of handling and augmenting episode data (wip) 2023-01-04 09:05:51 -03:00
utils feat: changes to log and discard some not-so-great data 2023-01-01 11:34:31 -03:00
__init__.py fixup! feat: initial commit 2022-12-18 17:29:15 -03:00