Posted at: 30 October

Senior Software Engineer, Vector Indexing

Company

LanceDB

LanceDB is an open-source, serverless vector database designed for fast, scalable billion-scale vector search, primarily serving the AI and machine learning industry with a B2B focus.

Remote Hiring Policy:

LanceDB is a remote-first team that supports flexible work hours, hiring from various regions to accommodate a diverse workforce.

Job Type

Full-time

Allowed Applicant Locations

Worldwide

Salary

$180,000 to $235,000 per year

Job Description

About LanceDB

LanceDB is a developer-friendly, open-source data lake for multimodal AI. From hyper-scalable vector search to advanced retrieval for RAG, from streaming training data to interactive exploration of large-scale AI datasets, LanceDB is the best foundation for your AI application, and powers some of the most groundbreaking applications and challenging requirements today.

About the Role

We’re looking for a Software Engineer focused on Vector Indexing to help build the next generation of vector-native data infrastructure. You’ll work on high-performance indexing and search systems at the core of LanceDB, enabling scalable similarity search, full-text search, and flexible indexing for the open-source and enterprise communities alike.

You’ll be responsible for:

  • Designing, building, and maintaining core vector indexing and search components

  • Implementing GPU-accelerated indexing algorithms and performance optimizations

  • Maintaining and evolving vector index algorithms, including pruning, quantization, and graph-based methods

  • Developing and optimizing full-text search capabilities and integrations

  • Benchmarking, profiling, and tuning performance across varied workloads

  • Writing and maintaining documentation, benchmarks, tutorials, and blog posts to support and grow adoption

  • Engaging with the open-source community: reviewing contributions, triaging issues, and joining design discussions

Requirements

  • Strong proficiency in Rust

  • Experience designing or implementing vector search or indexing algorithms (e.g., HNSW, IVF, PQ, quantization, clustering)

  • Proficiency in C for GPU-related development

  • Familiarity with GPU acceleration frameworks (CUDA, ROCm, etc.)

  • Demonstrated ability to benchmark, profile, and optimize system performance

  • Excellent written communication and documentation skills

  • Comfortable collaborating in open-source environments

Nice to Have

  • Understanding of full-text search systems (Lucene, Elasticsearch, Tantivy, etc.)

  • Experience building or maintaining data systems, databases, or search engines

  • Familiarity with distributed systems and scale-out architecture

  • Background in web APIs, embedding serving, or real-time systems

  • Contributions to or maintenance of open-source projects

What We Offer

  • A key role shaping an open-source project with real production usage

  • Remote-first team with flexible hours

  • Competitive compensation, equity, and benefits

  • Generous learning budget and support for open-source contributions

About the LanceDB Team


LanceDB was created by experts with decades of experience building tools for data science and machine learning. From co-authors of pandas to Apache PMC members of HDFS, Arrow, and Delta, the LanceDB team has created open-source tools used by millions worldwide.

Compensation Range: $180K - $235K