from Hacker News

Show HN: Data Extraction with Flexible Schemas

by sails on 12/16/24, 12:42 PM with 0 comments

Hey!

We’ve built a data extraction tool to flexibly automate data and document processing.

You’ve probably seen a few of these, so have we! A few of us have been varyingly stuck trying to automate the extraction of borrower financials for the past 5 years.

We think that there are a few missing features of most data extraction tools.

* They are usually too complex to quickly get up and running

* They are overly constrained in terms of what workflows and documents they support

We’ve always felt like speed and flexibility were sticking points, so we went slightly orthogonal to the alternatives.

It is deliberately very simple, with three key features

1. Generate a custom schema that is the target of the data extraction

2. Logic and “expert insights” can get added to field level prompts

3. Ability to share with externals via chat to get data and feedback, quickly!

The demo page is at https://go.sea.dev with an explainer and link to demo.

The demo app is constrained, but feel free to get in touch with me with details if you’d like to unlock the full capabilities

Some of the features we are working on:

* Improvements on OCR

* Advanced data management

* Email or WhatsApp to accept responses

* API and embedded data collection copilot for SaaS

* Meeting recording plugin, and streaming audio

* Securely share partial submissions with 3rd party for completion

* Enrichment via website scraping

* Integrations (Hubspot, Sheets etc)

We are a very small team, with a mix of researchers and engineers. We are leaning into improvements in data extraction, conversation evals and data structuring innovation to make the product as powerful and seamless as possible.

Our background is in financial services, so most of our effort is going into that use-case (business risk assessment to be specific), we’ve solved a bunch of our internal use cases as well as general purpose customer problems, so we thought to share it with a wider audience.

Would love to get your feedback and comments!