Software Engineer (AI Data Engine, Staff/Senior, Open Source, SaaS)
Bangun mesin data AI skala besar untuk dataset unstructured
Bangun dan kelola infrastruktur data AI skala besar untuk dataset besar seperti LAION 5B. Fokus pada pengolahan data terdistribusi dan integrasi sinyal tambahan seperti embeddings.
Kenapa Menarik?
Bekerja pada proyek besar dengan dampak nyata di industri AI
Skills Wajib
Keywords
Lihat Deskripsi Asli dari 4dayweek.io
Deskripsi asli dari 4dayweek.io
About Us At iterative.ai, we build open-source tools for machine learning DVC (12k+ ⭐ on GitHub), and enterprise-grade data infrastructure solutions. We also offer a team collaboration SaaS solution - Studio. We're a well-funded (Series A), remote-first team (50+ employees) on a mission to solve the complexities of managing datasets, ML infrastructure, ML models lifecycle, and other ML & data-centric workflows. We value great collaboration and communication skills, both among internal teams and in how we interact with our users. We take care to balance and be responsive to the needs of our open source community as well as our enterprise customers. Check us out in other places: 🖥 Website 📂 Docs 👾: GitHub 🖊 Blog ⏯️ YouTube 💬 Discord Job Description "... competitive advantage in AI goes not so much to those with data but those with a data engine: iterated data acquisition, re-training, evaluation, deployment, telemetry. And whoever can spin it fastest. " - A. Karpathy We are building the next generation of DVC - DVCx that will serve as a core infrastructure component to manage large amounts of unstructured data (e.g. on a scale of the LAION 5B dataset). How to create or improve a dataset in minutes if there are millions or billons of objects in a bucket? How to add additional signals (e.g. embeddings) at scale to a dataset like LAION 5B? Join us if you have experience in building big-data, distributed data processors (Spark, Ray, etc), if you have experience using data infrastructure like the one that is used in self-driving cars, if you have similar experience and you want to make this unstructured data management tools available in open source and SaaS. Responsibilities Own large new areas within our data management software, and build them from ground upParticipate in the entire product lifecycle from concept through productionBe able, and willing, to multi-task and learn new technologies quickly Must Have 5+ years of industry experience as a software engineerExperience building or working with AI infrastructure at scale (similar to Tesla's data engine, Waymo, etc) or similar relevant experienceSolid knowledge of PythonAt least one year of experience with file systems, concurrency, multithreading, and server architecturesPassionate about building highly reliable system software Great to Have Experience working remotelyExperience working on high performance database internals, or heavily distributed server backendsPrior startup experienceExperience at other API technology companiesCommand of modern system-level languages like Go or Rust ℹ️ Our Hiring Process We will go over the process with you in the Introductory call to make sure it is clear and you know what to expect. Here is the full interview process you can expect - It’s our go-to for most positions: 🤙 Introductory call [~1h] 👨🏫 Tech call with a team member [~45m] 👩🏾💻 Take-home coding task [real-world, asynchronous] - We pay for your time! See this FAQ. 🦾 Task summary / retro call [Optional, ~1h] ✏️ Offer 👩💻 Culture - We take care of our people 💖 Diversity - As a distributed company, diversity drives our identity. Whether you’re looking to launch a new career or grow an existing one, iterative.ai is the type of company where you can balance great work with great life, and work with a wonderful team that does the same! No matter who you are or where you’re from; we need you for what you can do and for caring about ML and delivering great developer tools! ⚖️ Equal opportunities - We strive to have parity of benefits across regions and while regulations differ from place to place, we believe taking care of our people is the right thing to do. No country or region takes precedence for personal growth, compensation, team recognition, or anything else, it just doesn’t matter where you are. 👣 Flexibility first - Ability to craft your calendar with flexible locations and schedules ⚓️ Team Driven Culture - Engineering team is involved in product discussions and pl