May 29
🔄 Hybrid – Manhattan
• Responsible for architecting, developing, and improving automated data pipelines • Optimizing data pipelines for maximum speed, scalability, and accuracy • Collaborating with data scientists and other engineers to define data requirements and structures • Writing unit tests and conducting system testing to ensure the reliability and integrity of data • Staying up to date with emerging trends and technologies in data engineering, data processing, distributed computing, and data quality assurance
• 3+ years in Data Engineering or a similar role, with experience building data pipelines from the ground-up • Bachelor's degree in computer science, software engineering, or a related field • Strong working knowledge of Spark, PySpark, or other data processing frameworks • Experience with SQL and NoSQL database technologies • Experience with automating data processing through Python libraries like Airflow • Strong problem-solving and analytical skills with keen attention to detail • Experience with AWS or other cloud platforms • Ability to manage multiple projects simultaneously and meet tight deadlines
• 100% health insurance coverage, with ability to join group dental and vision for a nominal fee • Team lunches every Thursday in NYC office • 4% 401K matching • 20 paid time off days • 5 paid sick days • 12 weeks of paid parental leave • 10+ annual company holidays
Apply Now