Crosby is an AI-first legal platform reimagining corporate legal services from the ground up. We are a team of technologists and legal experts who build proprietary technology and human-in-the-loop workflows that meaningfully improve how lawyers and machines work together — delivering speed, consistency, and quality across high-stakes work.
Our systems review complex documents faster and with exceptional accuracy, combining advanced AI with structured legal expertise. Clients receive AI-powered redlines, commentary, and negotiation guidance within hours, at a predictable, volume-based price.
Backed by Sequoia, Index Ventures, and Bain Capital Ventures, we're building the end-to-end contracting platform for the next generation of fast-growing companies.
The Engineering team builds the core systems and infrastructure that power Crosby’s AI-first legal platform. We work at the intersection of machine learning, product, and legal expertise to deliver intelligent, reliable systems that operate in high-stakes environments.
We prioritize speed, ownership, and high standards — shipping quickly while maintaining rigor in everything we build.
As a Data Scientist at Crosby, you'll play a critical role in developing the models and data systems that power our AI-driven legal platform. You’ll work across the full lifecycle — from data definition and labeling strategy to model development, evaluation, and iteration in production.
You’ll partner closely with engineers, product teams, and legal experts to translate complex legal workflows into structured machine learning systems. Beyond modeling, you’ll help ensure our systems are accurate, reliable, and continuously improving in real-world use.
Develop evaluation systems: Build metrics, benchmarks, and experimentation frameworks to measure and improve model performance.
Drive data strategy: Partner with legal and product teams to define labeling schemas, curate high-quality datasets, and improve data pipelines.
Support production systems: Work closely with engineering to deploy models, monitor performance, and iterate based on real-world usage.
Apply AI pragmatically: Leverage LLMs and other modern techniques to solve product problems, balancing sophistication with reliability and speed.
Collaborate cross-functionally: Partner with engineering, product, and legal teams to deliver end-to-end systems that improve customer outcomes.
1–5 years of experience in data science, machine learning, or applied NLP, ideally in a fast-paced startup environment
Strong foundation in machine learning, statistics, and data analysis, with proficiency in Python
Hands-on experience working with LLMs, NLP systems, or unstructured text data
Experience working across the full ML lifecycle — from data curation and experimentation to deployment and monitoring
Highly analytical with strong problem-solving skills and the ability to translate ambiguous problems into structured solutions
Strong communicator who can collaborate effectively with both technical and non-technical stakeholders
Crosby is an equal opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or other applicable legally protected characteristics.
Background checks for applicants will be administered in accordance with applicable law, and qualified applicants with arrest or conviction records will be considered for employment consistent with those laws, including the New York City Fair Chance Act.
Pursuant to New York Labor Law Section 194-b, the US Pay Range for this position is listed below. Final compensation will be determined based on skills, experience, and qualifications.
Compensation Range: $200k–$250k
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Join Habitat Energy's Austin team as an MLOps Engineer to productionize and scale mission-critical forecasting and optimization models that power energy trading and analytics.
NBCUniversal seeks part-time Academic Year Data Science & Analytics interns to support decision sciences and sports content analytics at its Stamford, CT hub.
Thrad seeks a hands-on Data Scientist to drive product and commercial decisions through rigorous causal analysis, experimentation, and analytics at its San Francisco HQ.
Apply state-of-the-art AI to financial workflows at Rowspace by building retrieval systems, agentic pipelines, and evaluation frameworks that turn unstructured data into actionable investment insights.
Pravah seeks an ML Summer Intern to work on cutting-edge forecasting, computer vision, and graph ML problems that support real-world electric grid deployments.
Visa is hiring a Staff Applied Scientist II to lead AI-driven data science and engineering workstreams on the Acceptance Platform, building scalable, secure ML solutions for global payments.
NBCUniversal is seeking an Advertising & Partnerships Data Science Academic Year intern to apply analytics, modeling, and AI/ML to real-world ad tech problems across its advertising product ecosystem.
Lead the measurement, experimentation, and data architecture for HingeSelect as the first dedicated Staff Product Data Scientist driving causal analysis, funnel optimization, and supply-demand modeling.
Lead Bugcrowd’s AI & Data Science team to design and operationalize scalable ML systems and generative AI solutions that power next-generation preemptive cybersecurity products.
Thrad seeks an Applied Scientist to design, train, and productionize real-time contextual ad relevance and bidding models for LLM conversations at its San Francisco HQ.
Superhuman is hiring Data Scientists in San Francisco to lead experimentation, causal inference, and ML-driven analyses that influence product and growth decisions.
Lead EDETEK’s AI strategy and engineering to build scalable, compliant AI solutions that accelerate clinical development and product innovation for biopharma clients.
Lead the development of an AI Agent platform and production ML systems at Superhuman, driving orchestration, LLM integration, and proactive product experiences.