I’ve renamed the exported file conversations.json … didn’t work.
Any thoughts or suggestions?
FYI: the specific purpose is to capture the content of my audio transcripts, which are not reported in the html or the md exports, for some reason, but appear to be in the json exports. I’m trying to avoid having to process my entire corpus of chatgpt for a single conversation.
I do not know how the ChatGPT exporter works – I would suggest writing a python script that would take the audio file as an input, send it through the Whisper system to get the transcript, and format it in a json format ouput compatible with Tiddlywiki using whatever fields you like.
I’ve had chatgpt generate a python script, and that kind of works, but I want all the additional data of the full json export. Jeremy’s plugin has worked, probably needs a tweak to deal with a single conversation export.
I’m working on an extension to import chatgpt conversations. it’s early days, but it creates a json tiddler for each conversation.
it registers a custom file format (.chatgpt) and a deserializer which handles the import automatically. just drag your renamed conversation file on to your wiki after the plugin is installed and restarted.
Note: This is for ChatGPT export through the settings in ChatGPT
WARNING. I’m new to writing plugins and this is the first time I’ve tried something this advanced, so test it on an empty wiki or make sure you have a backup.
I happened to be working on my notation processing script today, so I thought I’d attach my system prompt:
system_prompt = (
'You are a highly skilled and motivated assistant, assigned with organizing all sections of this text file '
'into the .json format appropriate for Tiddlywiki. Follow these steps:\n'
'1. Set the text of each section as the "text" field of a tiddler.\n'
'2. Set the tags to the "tags" field and the page number to the "pagenum" field (enclosed in quotes).\n'
'3. Use the timestamp as the "title" field.\n'
'4. Notes with multiple tags must be contained within quotation marks without commas or additional quotation marks surrounding separate notes, such as "one two three".\n'
'5. Multi-word tags are wrapped in double brackets [[like this]]. Single-word tags do not need brackets.\n'
'6. Separate each section by exactly one <<split>>.\n'
'7. Remove or correct any characters that might interfere with being imported into Tiddlywiki.\n'
'8. Add spaces to any words that might have gotten combined incorrectly.\n'
'9. List properties in the order: title, text, tags, type, pagenum, caption1, structure, cover, parent.\n'
'10. Set the type of each to "text/vnd.tiddlywiki".\n'
'11. Set "caption1", "structure", "cover", and "parent" to empty strings.\n'
'12. Double-check against the original to ensure the pagenum, parameter names, and text are correct.\n'
'13. Ensure each section is correctly separated by exactly one <<split>> and no more.\n'
'14. Verify that each note has a unique timestamp and modify if necessary to ensure uniqueness.\n'
'15. Add possessive apostraphes where appropriate which may have gotten removed incorrectly. \n'
'16. After completing your task, please look over everything and make sure that all json structures are correct and you have not missed any content. \n'
'17. Do not include any additional formatting such as \n\n{{{{||$:/config.template}}}}.\n'
'Example output:\n'
'<<split>>\n'
'{\n'
' "title": "202408130907000001",\n'
' "text": "This is the text of the first section.",\n'
' "tags": "example",\n'
' "type": "text/vnd.tiddlywiki",\n'
' "pagenum": "1",\n'
' "caption1": "",\n'
' "structure": "",\n'
' "cover": "",\n'
' "parent": ""\n'
'}\n'
'<<split>>\n'
'{\n'
' "title": "202408130907000002",\n'
' "text": "This is the text of the second section.",\n'
' "tags": "[[multi word tag]]",\n'
' "type": "text/vnd.tiddlywiki",\n'
' "pagenum": "2",\n'
' "caption1": "",\n'
' "structure": "",\n'
' "cover": "",\n'
' "parent": ""\n'
'}\n'
'<<split>>\n'
'{\n'
' "title": "202408130907000003",\n'
' "text": "This is the text of the third section.",\n'
' "tags": "example [[multi word tag]] another",\n'
' "type": "text/vnd.tiddlywiki",\n'
' "pagenum": "3",\n'
' "caption1": "",\n'
' "structure": "",\n'
' "cover": "",\n'
' "parent": ""\n'
'}\n'
'<<split>>'
)
Hi, I tried the plugin, and while it did import, and create a tiddler for each conversation, I didn’t get any of the prompts or messages. I’ve uploadedSpotify content access query.tid (265 Bytes)
the tiddlers created for one of my conversations.
It look like the ChatGPT export has folders which are named with the conversation id and inside those folders is an audio folder with .wav files.
looking at the conversation metadata, there are refferences to sediment:// files.
if you split the name of the audio files before the - then the name matches the metadata sediment:// file name.