from Hacker News

JFK Assassination Records Dataset on Hugging Face

by farhanhubble on 4/9/25, 7:20 AM with 3 comments

  • by farhanhubble on 4/9/25, 7:20 AM

    I am releasing JFK-TELL, a dataset I generated by extracting text from the scanned PDFs of the assassination records released until April 2025. The extraction was done with Google Gemini LLM API to generate Markdown text, using a very simple prompt. For detailed methodology, check out the Github repo at https://github.com/farhanhubble/jfk-tell