Role Overview
We are searching for a talented data engineer to spearhead efforts in strengthening our data framework. This position entails constructing and refining systems to handle a variety of data, such as business, functional, genetic, and patient-related information, while integrating advanced artificial intelligence solutions to create organizational value.
Core Duties
- Data Infrastructure Development: Create and sustain high-performance data systems to enable reporting, analysis, machine learning, and biological data applications.
- Data Framework Design: Provide secure, uniform, and centralized data access throughout the organization.
- Data Processing: Develop efficient pipelines to manage and process substantial datasets from internal and external sources.
- AI and Machine Learning Enablement: Support computational teams in utilizing clinical and genetic data for AI and ML initiatives.
- Team Collaboration: Partner with diverse technical and business teams to understand requirements, align on objectives, and implement solutions.
- Data Standards: Uphold data governance practices to meet regulatory and compliance requirements.
- Process Optimization: Identify and pursue opportunities to improve workflows and systems, emphasizing quality and effectiveness.
What Makes This Role Exciting?
This position places you at the forefront of healthcare innovation, addressing intricate challenges with real-world implications for patient care. Youβll work with cutting-edge tools and witness the direct outcomes of your contributions.
Candidate Profile
We seek a data engineer with exceptional problem-solving and systems-oriented thinking. The ideal individual excels at converting business needs into technical implementations and has a demonstrated history of crafting reliable data ecosystems.
Education
- Bachelorβs degree in Computer Science, Engineering, or a comparable technical discipline (e.g., Physics, Math).
Experience
- Over 10 years in data engineering or software development, including a minimum of 5 years working on expansive data systems.
Technical Competencies
- Data Platforms & Cloud: Skilled in top-tier data platforms (e.g., Databricks, Snowflake) and cloud environments (e.g., AWS, GCP, Azure).
- Data Pipelines: Proficient with ETL/ELT tools (e.g., Fivetran, Airbyte) and large-scale processing frameworks (e.g., Spark, Kafka).
- Coding: Advanced proficiency in Python and SQL, adhering to software development best practices (e.g., version control, CI/CD).
- Data Structuring: Thorough knowledge of data architecture, spanning databases, warehouses, and lakes.
- Analytics Tools: Experience with visualization platforms like Looker, Tableau, or PowerBI.
- ML/AI Expertise: Familiarity with data science libraries (e.g., pandas, numpy) and ML tools (e.g., MLflow).
- Generative AI: Understanding of AI frameworks (e.g., LangChain, Bedrock).
- Compliance: Background in maintaining data integrity and adhering to regulations, especially with sensitive data.
Interpersonal Skills
- Thrives in a fast-moving, ever-changing workplace.
- Balances efficiency with high-quality outputs.
- Excellent planning and communication abilities.
- Dedicated to delivering top-notch results.
Bonus Qualifications
Preference will be given to candidates with experience in healthcare or life sciences, knowledge of genomics or bioinformatics, or a history of operationalizing AI solutions.
Β