I bought a 3090ti recently, I think its time to train an AI that can auto-reply to new-bee level questions, as well as write some basic macro and widget-call wikitext.
QA AI
I would need some publicly accessible data to build the dataset, but crawling this forum is not efficient like a direct download. @boris seems you host the forum server? Do you think it’s ok to send me a zip or sqlite of public conversation data in this forum? (Exclude the user info, and private DM).
Would people here mind this?
For google forum, I may have to write a crawler.
If you have other suggested dataset, please reply. For example, some wiki that is full of tutorials? I can’t just crawl the https://links.tiddlywiki.org/, some site may contains non-tiddlywiki related personal info.
Code AI
I think I will download tiddlywiki plygins from Github. Then I need to add comments to plugins, (with help of QA AI). So trained AI can generate more reasonable wikitext using chain-of-thought (output reason in a comment, then code).