Full Job Description
Guardant Health is a leading precision oncology company focused on helping conquer cancer globally through use of its proprietary blood tests, vast data sets and advanced analytics. Its Guardant Health Oncology Platform is designed to leverage its capabilities in technology, clinical development, regulatory and reimbursement to drive commercial adoption, improve patient clinical outcomes and lower healthcare costs. In pursuit of its goal to manage cancer across all stages of the disease, Guardant Health has launched multiple liquid biopsy-based tests, Guardant360 and GuardantOMNI, for advanced stage cancer patients, which fuel its LUNAR development programs for recurrence and early detection. Since its launch in 2014, Guardant360 has been used by more than 6,000 oncologists, over 50 biopharmaceutical companies and all 27 of the National Comprehensive Cancer Network centers.
The Data Platform team provides an enriched and valuable ecosystem of data sources and data services that drive innovation for internal and external systems. This team is dedicated to developing advanced technology (Big Data , Cloud, Machine Learning), systems and services to make data secure, rich, high quality, and fast therefore enabling Guardant the ability to leverage its data assets in an effective and timely manner to maximize technology/business development in the extraordinarily complex oncology diagnostic and therapeutic landscape.
We connect patients with clinical trials, help clinicians order our test and receive our clinical reports, and deliver valuable genomic datasets to researchers to help uncover important insights into treatment paradigms and drug discovery. Our technology stack reflects our views of using the best tools for the job, employing Scala, Java, Python along with Kubernetes, Apache Spark, Presto, Kafka, Docker, MySQL, MongoDB and a variety of AWS services to analyze and disseminate vast volumes of genomic data.
Data Acquisition: Utilize expert coding skills to build real-time distributed and reliable data pipelines that ingest and process data at scale.
Data Architecture: Expertise in designing and building big data systems, data lakes; can translate the needs of the business to productize models and data visualizations into a very functional data architecture; partners with Healthcare Intelligence.
Data Validation / Accuracy: Develop quality checks to ensure data accuracy and integrity; recommend process improvements that enhance data integrity; ensure ongoing data integrity and performs skillful data validation.
Reporting / Analysis: Work independently with senior leaders to tackle complex problems by developing sophisticated, testable hypotheses; presents findings formally to diverse stakeholders and committees; meaningfully identifies opportunities for improvement that result in change.
Display / Visualization: Proficient with data visualization tools; develop visualization concepts; deliver excellent visual storytelling; solve complex technical challenges.
Clinical Data Expertise: Strong analytic resource in clinical subject areas with good understanding of the characteristics of data in sources including the EDW and the Data Lake.
10+ years of software development experience
Minimum 4 years of experience on Big Data Platform or Domain Experience
Excellent experience with programming languages such as Scala and Java
Strong experience coding with streaming/micro-batch compute frameworks, preferably Spark
Work collaboratively with business, bioinformatics scientists and translates business requirements into enterprise information architecture
Drive the architecture of data integration from various clinical application and stores, research databases and external sources
Develop the processes for updating and maintaining terminologies, and vocabularies including mapping from local to international standards when applicable
Strong knowledge of statistics, data analysis and databases
Strong hands on skills in SOLR querying and Indexing, configuring schema, understanding in advanced schema fields, deciding commit strategies and tuning the relevancy of search results
Flair for data, schema, data model, how to bring efficiency in big data related life cycle
Expertise in designing and building data warehouses in Big data systems, dimensional data models and strong hands-on SQL knowledge
Understanding of automated QA needs related to Big data
Understanding of various Visualization platform (Tableau, D3JS, others)
Proficiency with agile or lean development practices
Strong object-oriented design and analysis skills
Strong aesthetic sensibility that supports clear visual communication of quantitative information
Experience with application performance monitoring and assessment desired
Knowledge of healthcare including Clinical terms and concepts is a plus
Experience with managing data in regulated healthcare environment (HIPAA compliant) is a plus
BS/MS/PhD in a quantitative scientific field (computer science, engineering, mathematics, statistics, bioinformatics, etc.)
Top skill sets / technologies in the ideal candidate:
Programming languages - Java (required), Scala, Python, R
Databases - Oracle, complex SQL queries, performance tuning concepts, AWS RDS, Apache Presto, RedShift; NoSQL - HBASE, MongoDB, Cassandra
Batch processing - Hadoop MapReduce, Apache Spark, AWS EMR
Stream processing - Spark streaming, Apache Storm, Flink
ETL Tools - Data Stage, Informatica, Nifi
Code/Build/Deployment - GIT, SVN, Maven, SBT, Jenkins, Bamboo
You have strong knowledge and experience addressing a broad range of accounting matters, ensuring it is processed in compliance with established internal controls. You possess analytical skills needed to correctly grasp and communicate data, and analyze and reconcile accounts; ability to handle confidential and sensitive information with the appropriate discretion; and handle multiple deadlines.
You are a self-starter, work well as a team player, but can work independently when appropriate. You possess the ability to analyze problems and actively strategize to resolve them, pay attention to detail, and have excellent organization and communication skills. You are results oriented. You can juggle multiple tasks, work cross-functionally and at all levels of the organization, whether internally or externally. You are flexible and comfortable in a dynamic, fast-paced environment and can prioritize to focus on the important, not just the urgent.
Employee may be required to lift routine office supplies and use office equipment. Majority of the work is performed in a desk/office environment; however, there may be exposure to high noise levels, fumes, and biohazard material in the laboratory environment. Ability to sit for extended periods of time.
Guardant Health is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, or protected veteran status and will not be discriminated against on the basis of disability.
All your information will be kept confidential according to EEO guidelines.
To learn more about the information collected when you apply for a position at Guardant Health, Inc. and how it is used, please review our Privacy Notice for Job Applicants
Please visit our career page at: http://www.guardanthealth.com/jobs