Search jobs > San Francisco, CA > Data engineer

Data Engineer (Spark/Python)

Fintool
San Francisco, California, US
Full-time

About

Fintool is an AI Equity Research Copilot for institutional investors. It’s a LLM on top of financial documents, starting with SEC filings.

Fintool is engineered to discover financial insights beyond the reach of timely human analysis. We are building the AI Warren Buffett.

Below covers everything you need to know about what this opportunity entails, as well as what is expected from applicants.

We are on the fastest growing LLM vertical applications. Thousands of investors signed up for Fintool. Fintool is backed by Y Combinator and entrepreneurs such as the co-founders of Datadog, Vercel, HuggingFace, or domain experts from OpenAI to Deepmind.

Team

Nicolas Bustamante : spent 7 years building one of the largest AI-driven legal search engines (Bloomberg for lawyers). Nicolas hired nearly 200 people, secured millions of dollars in debt and equity funding, and the profitable business was successfully acquired by Summit Partners, a $43B billion growth equity fund, for $x00M+.

Edouard Godfrey : worked for 9 years at Apple, leading teams of data scientists and engineers. He worked on Apple Search (Spotlight) and Apple Pay, maintaining big data pipelines and deploying cutting-edge AI models.

He received the 2019 Apple Pay Innovation Award for outstanding contributions and fresh insights.

Our Philosophy

Small team : small in-person teams outperform large and well-funded companies. When people visit our office, they should be surprised by how few people we are.

Ship code : we avoid meetings and PM jargon to release early, release often, and listen to customers.

In-person : we believe high-performing teams do their best work, build long-term relationships, and have the most fun in person.

Company Values

Clone and improve the best : we're not about reinventing the wheel but about enhancing proven success. We are shameless cloners who stand on the shoulders of giants.

We draw inspiration and then create differentiation because distinctiveness drives dominance.

Release early, release often, and listen to your customers : speed matters in business, so we push better-than-perfect updates for customers ASAP.

Mastery comes from repeated experiments and learning from mistakes rather than putting in a set number of hours. It’s 10,000 iterations, not 10,000 hours.

Warren Buffe We model our personal and professional ethos on the principles he exemplifies. Upholding integrity, valuing honesty, practicing frugality, championing lifelong learning, embracing humility, extending generosity, applying rationality, and demonstrating patience.

Every day, we strive to mirror these Buffett-inspired virtues.

Job Description

We are building real-time data pipelines for millions of unstructured financial documents to feed our financial LLM. You will build real-time data pipelines to process millions of financial docs, and build ML-based parsers to chunk and tag intelligently to index into our Elastic that will then feed our LLM.

It’s cutting-edge data engineering at the AI frontier.

Requirements : Spark / Databricks, Python, Postgres, and LLM. Knowing Elastic, Next.js, TypeScript, and web crawling is a plus.

Experience : 3+ years of deploying production code at a company with a large infrastructure.

Location : San Francisco (no remote).

Contract : Full-time.

Apply via the form below.

J-18808-Ljbffr

6 days ago
Related jobs
Promoted
Fintool
San Francisco, California

Spark/Databricks, Python, Postgres, and LLM. Edouard Godfrey: worked for 9 years at Apple, leading teams of data scientists and engineers. It’s cutting-edge data engineering at the AI frontier. Fintool is engineered to discover financial insights beyond the reach of timely human analysis. ...

Promoted
SynergisticIT
Oakland, California

Currently, we are looking for entry-level software programmers, Java Full stack developers, Python/Java developers, Data analysts/ Data Scientists, Data Engineers, Machine Learning engineers for full time positions with clients. We want Data Science/Machine learning/Data Analyst and Java Full stac...

SGA
San Francisco, California

We are seeking an enthusiastic and inquisitive Data Engineer who is passionate about data and proficient in programming with Python. Python Data Engineer (Content Operations). Python coding: Write clean, efficient, and maintainable code as standalone scripts in Python to implement data solutions and...

E-Solutions
California, United States

Job Title: Python Data Engineer Location: Remote Duration: Long Term JD: Must Have: Python,FHIR,JSON,HL7 “Disclaimer: E-Solutions Inc. ...

Intelliswift Software Inc
San Francisco, California

We are seeking an enthusiastic and inquisitive Data Engineer who is passionate about data and proficient in programming with Python. Python Data Engineer - Content Operations. Python Data Engineer - Content Operations. Python coding: Write clean, efficient, and maintainable code as standalone script...

E-Solutions
California, United States

We need Python developers who are heavy on the data engineering side, and developers who have used Python for complex data scenarios such as data migrations, matching, merging and collation. Role: Data Engineer with Python, FHIR & JSON. Developers should have used Python to build and manage data pip...

Promoted
Metropolitan Transportation Commission (MTC)
San Francisco, California

The Program Coordinator/Analyst, Transit and Asset Management Data role will be filled at the Assistant/Associate level and is under the general supervision of a Principal Program Coordinator. The Program Coordinator for Transit and Asset Management Data, as a member of the Funding Policy and Progra...

Promoted
University of California-Berkeley
Berkeley, California

Working skills in data preprocessing/cleaning, statistical analysis, systems programming, database design and data security measures, especially regarding large datasets. Develops database tool for tracking research data and assists in transmission and presentation of data. Provides technical assist...

Promoted
VirtualVocations
Oakland, California

A company is looking for a Pentaho Developer with extensive experience in Pentaho Data Integration (PDI). ...

Promoted
LHH
CA, United States

You will assist with all development stages for the SQL databases, write SQL queries, and conduct SQL database troubleshooting. The SQL Developer will also be responsible for the implementation, configuration, maintenance, and performance of SQL Server to ensure the availability and consistent perfo...