from Hacker News

Ask HN: How to Parse and Process BofA Statements?

by surds on 8/18/21, 9:14 AM with 4 comments

I have previously seen some discussions on HN where folks have shared open-source or personally-developed solutions to parse reports and statements from various financial institutions.

I need to parse and visualize statements over last few years and see the trends of spending. The statements are available only as PDF now. Is there any way to do this?

Appreciate all pointers!

  • by cafard on 8/21/21, 7:50 PM

    I have once or twice had luck with the Python Camelot package, which you read about at https://camelot-py.readthedocs.io/en/master/ .

    But one can burn quantities of time trying to extract useful information from PDFs, with small results. I wish you luck.

  • by mackatsol on 8/19/21, 5:19 PM

    I’ve heard of tools in Python that can extract data and text from PDF files…

    My bank offers CSV downloads of the same data. Look for that first! :)