Senior ML Graph Engineer | Remote | #1103

  • Position: Senior ML Graph Engineer
  • Location: Remote
  • Contract type: B2B

About Us

Our client is building an intelligent code discovery platform that gives developers the best tools to discover code in any form and be more productive. We are transforming code search to improve the practice of modern programming—taking a graph-based approach using data from the entire open source ecosystem. We’re on a mission to build the world’s best code discovery engine. The company is funded by top investors in Silicon Valley, including the first investors of Google, Twitter, Zoom, LinkedIn, and Uber. Our team has backgrounds from NASA, LinkedIn, Facebook, Amazon, AWS, Cisco and MIT, Harvard, Stanford, and Berkeley. Our company is based in San Francisco, California, but our team is all remote and globally distributed.


We seek an Machine Learning Engineer with expertise to lead technology development on the frontier of code discovery and developer productivity. You will be responsible for setting up a robust & versatile knowledge graph, bringing together data from a variety of sources and leveraging AI to empower other software engineers to be more effective. Our team expands the state of the art in machine learning technology, which enables thoughtful, efficient, and intuitive ways to search, re-use, explore or process metadata around the code. You will use world-class engineering and machine learning techniques on real-world, internal, and external data to directly impact the evolution of software development. A successful applicant is an expert in machine learning, software engineering, and complex data analysis spanning natural language, code syntax, and networks.

Who Will Love This Job

You have built large scale machine learning models and systems. You are passionate about technology and can keep up with the latest developments. You have academic and/or proven experience in knowledge graphs, data management, and natural language processing. You are a skilled technical influencer and have a track record of successfully influencing product direction. You have a respectful, collaborative approach that earns the trust of your peers and stakeholders.


  • Our backend, data fetching pipelines, tooling is built with Go

  • Our frontend is built using TypeScript & Svelte

  • Our ML stack is built using Python & PyTorch

  • Our cloud automation is built using Terraform

  • Our data is primarily stored in PostgreSQL

  • Our search engine is powered by OpenSearch

  • Our services run on Google’s Cloud Platform


  • Design, develop and lead implementation of our knowledge graph

  • Build complete data processing systems that drive products, systems or applications

  • Identify and model various types of data sources to improve & scale knowledge graph

  • Design and implement algorithms to infer connections in the knowledge graph for validation

  • Oversee design, development and deployment of internal and external APIs

  • Apply the latest techniques & academic research to real world problems in a production environment

  • Review code, mentor other engineers and support the data science and engineering teams

  • Build and deploy robust machine learning / deep learning models that improve entity extraction, classification, resolution and disambiguation

  • Collaborate with technical and non-technical business partners to develop analytics and metrics that describe the performance of matching systems and the quality of our data.

  • Determine the feasibility of projects through quick prototyping with respect to performance, quality, time and cost using Agile methodologies

  • Architect best-in-class infrastructure (platforms, tools, and approaches) to accelerate our research in the production phase and to unblock efficient deployment, optimization, and testing of ML models

Minimum Qualifications

  • 8+ years expertise as a backend/data engineer building graphs systems and graph databases

  • 5+ years expertise with machine learning and/or natural language processing

  • Degree in Computer Science, Machine Learning, Data Science or related field, with expertise in knowledge representation

  • Expertise in two or more programming languages (e.g. Python and Go)

  • Expertise working with graph data models (RDF, Property), databases (Amazon Neptune, Neo4J, TigerGraph), or graph query languages (Gremlin, SPARQL, Cypher)

  • Expertise with at least two or more ML/DL framework (e.g. PyTorch, TensorFlow, Keras)

  • Expertise architecting data pipelines and acquiring data needed to build and evaluate models, using tools like Dataflow, Apache Beam, or Spark

Preferred Qualifications

  • PhD in computer science, artificial intelligence, machine learning or related technical field

  • Advanced working knowledge of information retrieval and search technologies and have set up and used open-source search systems to query and understand data

  • Expertise coding in Go

  • Expertise with Graph ML and graph learning problems, solutions & analytics (e.g. graph embedding and graph neural networks)

  • Experience with many of the following technologies:

  • Graph databases

  • Modern ML Models (e.g. BERT, Pytorch Geometric)

  • MLOps and Infrastructure

  • ElasticSearch, Solr and Lucene

  • Kubernetes, Docker, Terraform

  • Machine learning infrastructure

  • CircleCI, GitHub Actions, Jenkins

What We Offer

You have the opportunity to join an early stage startup and have significant ownership of technology development. You will work at the highest level and collaborate with world-class colleagues, advisors and technical experts. Our team is growing rapidly and we hope you’ll grow with us, too.

  • Competitive salary & equity packages

  • Unlimited vacation and sick leave

  • Strong remote work culture and esprit de corps



... i zgarnij do 2500 PLN!