Backend Engineer
Bangun dan optimalkan sistem data untuk aplikasi akademik Paperpile
Backend Engineer di Paperpile akan mengembangkan dan memelihara sistem yang menangani data akademik skala besar. Anda akan membangun pipeline data, mengoptimalkan pencarian, dan mengelola PDF di skala besar. Kerja ini menggunakan Node.js dan TypeScript, serta AWS untuk deploy dan operasi layanan.
Kenapa Menarik?
Bergabung dengan tim yang mengelola database akademik terbesar dengan lebih dari 250 juta artikel
Tanggung Jawab Utama
- Membangun dan memelihara pipeline data untuk mengolah sumber data heterogen
- Mengoptimalkan sistem pencarian teks penuh termasuk strategi indeks dan optimasi kueri
- Mengembangkan API REST yang handal untuk mengakses data akademik
- Mengelola dan memproses PDF di skala besar dengan kualitas data tinggi
- Menggunakan web scraping dan API pihak ketiga untuk mengumpulkan data
Persyaratan
- Pengalaman dalam backend engineering dengan sistem data-heavy
- Familiaritas dengan Node.js dan TypeScript
- Pengalaman deploy dan operasi layanan di AWS
- Paham dengan sistem pencarian teks penuh dan API REST
Skills Wajib
Keywords
Lihat Deskripsi Asli dari RemoteOK
Deskripsi asli dari RemoteOK
Paperpile runs on data at scale, with a literature database of 250M+ academic papers and a growing body of user data accumulated over more than a decade. You'll work across the systems that ingest, process, store, and serve this data reliably: building pipelines, optimizing search, handling PDFs at scale, and exposing clean APIs. Requirements Strong backend engineering background with experience building and operating data-heavy systems in production. Experience deploying and operating services on AWS. Experience designing and maintaining data ingestion pipelines handling messy, heterogeneous sources. Comfortable with web scraping and working with third-party data sources and APIs. Familiarity with Node.js and TypeScript. Itâs fine if you come from a different background, such as Java or Python, but you should be comfortable working in this environment. High standards for data quality. You think carefully about correctness, deduplication, and consistency. Solid understanding of full-text search systems including indexing strategy, relevance tuning, and query optimization. Proficient in building reliable REST APIs. More useful experience Familiarity with academic publishing formats and data sources (PubMed, Crossref, arXivâ¦) Experience with PDF processing pipelines (extraction, transformation, storage and delivery at scale). Experience with LLM-based document processing or ML pipelines for extracting structured data from unstructured text. Large scale web crawling and scraping. Compensation Base compensation â¬60,000ââ¬90,000 based on the level of your experience Bonus/equity program. Please mention the word **NOURISH** and tag RMTQxLjI1My4xMDkuOTc= when applying to show you read the job post completely (#RMTQxLjI1My4xMDkuOTc=). This is a beta feature to avoid spam applicants. Companies can search these words to find applicants that read this and see they're human.
Situs sumber mungkin diblokir ISP Indonesia
Beberapa ISP Indonesia (Telkomsel, Indihome) memblokir RemoteOK. Kalau tombol Apply tidak terbuka, coba pakai data seluler atau VPN.
Tips: ganti jaringan atau aktifkan VPN, lalu klik Apply lagi.
Data & laporan pasar
Riset gaji & permintaan skill dari data lowongan kami sendiri.
- Lowongan IT Indonesia vs Remote Global (2026)Analisis data primer 2.049 lowongan: metodologi, klasifikasi, dataset bisa diunduh.
- Permintaan Skill AI: Indonesia vs Global (2026)10.000+ lowongan, classifier taxonomy-first, Wilson CI, pra-registrasi sebelum analisis.
- Laporan Hiring Indonesia: Tech vs Non-TechPermintaan lowongan per bidang dari hitungan agregat — bukan listing per-listing.
- Benchmark Gaji IndonesiaKisaran gaji agregat lintas peran, dengan metodologi dan dataset terbuka.
- Laporan Pasar Remote per PeranLaporan otomatis per kelompok peran — skill, senioritas, perusahaan, gaji.