docs: add basic roadmap

2022-12-20 17:40:48 -03:00 · 2022-12-20 17:40:48 -03:00 · 45f495a4dd
parent 30ff3751de
commit 45f495a4dd
1 changed files with 32 additions and 0 deletions
--- a/ROADMAP.md
+++ b/ROADMAP.md
@ -0,0 +1,32 @@
+# Project Roadmap
+
+## An intro for dummies
+
+If you're not familiar with all the theory/technology behind the project, here's a super simplified rundown:
+
+- There are "text generation AIs" that are freely available for researchers. These are called open-source LMs (language models).
+- Modern chatbots are usually made by taking a language model and "fine-tuning" it, which basically just means feeding it data similar to what you want it to generate.
+  - In our case, this means fine-tuning it with conversation and roleplay data (research usually calls this "dialogue data", and they call models fine-tuned on dialogue data "dialogue models").
+- LMs can have different "sizes". For example, Meta's OPT language model is offered in 125m, 350m, 1.3B, 2.7B, 6.7B, 30B and 66B sizes (where "m" = million and "B" = billion parameters).
+  - The bigger the model, the better its quality. However, the more hardware you need to fine-tune and use it. And when I say more, I don't mean a couple more gigabytes of system RAM, I mean going from a single 6GB GPU to hundreds of 80GB GPUs.
+
+
+So, knowing the above, our main "top-level"/medium-term objective at the moment is to get as much good quality data as we can, and fine-tune the biggest model we can. From there, we can play around with the models and see what the results are like, then debate and decide how to move forward.
+
+---
+
+For anyone who's interested in the actual details, here's a TL;DR version of the project's current roadmap at the task level:
+
+## Current Status
+
+- We have all the tooling to build a dataset from various sources, fine-tune a pre-trained LM on that dataset, and then run inference on checkpoints saved during the fine-tune process.
+  - All of that tooling can be found within this repository.
+- We have taken a small model, Meta's OPT-350m, and fine-tuned it on a small dataset we've built with the tooling described above.
+
+## Next Steps
+
+- We will release our tiny prototype model (which we're calling `pygmalion-350m`) to the public.
+  - This is _not_ to show off results (since they suck), but rather just to show we _are_ actually working on the project and we're capable of creating something.
+  - The model will likely be hosted on HuggingFace, and have a simple Gradio UI for inference.
+- We will release a browser userscript which allows CharacterAI users to dump their (anonymized) chat histories. We will then allow anyone to anonymously send us their dumps to be used as training data for the model.
+- Once we have enough data, we will move on to trying to fine-tune OPT-1.3B.