Data Scientist (Mammalian Systems)

The Data Scientist is a key role within Intrexon’s Data Science & Computational Biology Unit. This position is responsible for modeling complex problems, discovering insights, and identifying opportunities through the use of statistical, algorithmic, mining and visualization techniques. In addition to advanced analytic skills, the candidate will also be proficient at integrating and preparing varied datasets, helping architect specialized databases, and computing environments, and communicating results. Working closely with scientific and IT partners throughout the organization, the Data Scientist will turn data into critical information and knowledge that can be used to make sound organizational decisions. The Data Scientist must be a creative and influential thinker that can propose innovative ways to look at problems across the analytics maturity spectrum from descriptive to diagnostic, predictive, and prescriptive. The successful candidate will create and use statistical tools to enable data analysis and pattern recognition, supporting and developing Design/Build/Test/Learn protocols associated with complex mammalian biology experiments required for human drug development programs. The role will require a combination of business focus, strong analytical and problem solving skills, deep understanding of mammalian molecular and cellular biology and workable understanding of developmental and physiological functions; and programming knowledge to be able to quickly cycle hypotheses through the discovery phase of the project. Familiarity with molecular biology and microbial physiology concepts is desired.


Job Snapshot:
Posted On:
Location:Germantown, MD
Department:Bio-Informatics Division
Job Type:Full Time
Education:PhD (1+ years’ experience), MS (3+ years’ experience) or BS (5+ years’ experience) in physics, mathematics, statistics, genetics, engineering, bioinformatics, computer science or a related field
Experience:Experience in establishing a bioinformatics pipeline for target and/or drug discovery, statistical data analysis, and design of experiment approaches
Reference Id:1426
Travel Required:No
Manage Others:No
 
Description:
DUTIES AND RESPONSIBILITIES:
  • As a member of cross-functional project teams, work with partners to identify and exploit analytical opportunities, including experimental planning, design and analysis.
  • Apply rigorous statistical analysis, modeling, simulation, and predictive analytics to myriad experimental data sets and raise awareness of the value of various methodologies through education.
  • Propose new experiments and analytical processes to address novel questions that leverage Intrexon’s technology platform for developing gene therapies, genetically modified cell therapies, tissues and mini-organs for regenerative medicine, and cell-based biomaterial manufacturing hosts for production of DNA, RNA, Protein, subcellular organelles, virus-like particles, viruses, and minicells.
  • Be a key partner in supporting Intrexon’s genome engineering activities required for the development and characterization of cells employed.
  • Present findings to research partners by exposing their assumptions and validation work in a way that can be easily understood and leveraged by non-data scientists.
  • Gain insight into various therapeutics and diseases through the analyses of Next Generation Sequencing and various ‘omics datasets.
  • Ability to develop data-driven hypotheses, generate insight and propose novel experimental ideas.
  • Prepare technical reports and make presentations to project teams, leadership, and other stakeholders. 
EDUCATION AND EXPERIENCE:
  • PhD (1+ years’ experience), MS (3+ years’ experience) or BS (5+ years’ experience) in physics, mathematics, statistics, genetics, engineering, bioinformatics, computer science or a related field.
  • Experience in establishing a bioinformatics pipeline for target and/or drug discovery, statistical data analysis, and design of experiment approaches.
  • Proven track record of accomplishments using applied machine learning and/or statistical techniques, preferably in the bio-industrial, life sciences, biotechnology, and pharmaceutical fields.
  • Understanding of eukaryotic and prokaryotic biological systems including molecular and cellular biology.
TECHNICAL SKILLS:
  • Hands on experience in developing and implementing methods in descriptive, predictive, and prescriptive analytics & computational statistics.
  • Experience with supervised and unsupervised machine learning theory and practice.
  • High dimensional data analysis (p >> n, dimensional reduction, clustering, etc.).
  • Frequentist and Bayesian statistical analyses (ANOVA, hierarchical and mixed modeling, FDR procedures, etc.)
  • Statistical computing in R or Python.
  • Familiarity with database query performance optimization.
  • Software development skills (functional vs object oriented design, version control, modular design).
  • High Performance Cloud Computing and Big Data computing architectures.
  • Familiarity and applied experience in cutting edge techniques for machine learning.
DESIRED KEY COMPETENCIES:
  • Ability to understand and execute on the company’s mission and values.
  • Maintain a high degree of ethical standard and trustworthiness.
  • Deals with conflict in a direct, positive manner.
  • Ability to think and adapt to a rapidly changing environment.
  • Able to reach rational conclusions through complex processing of information.
  • Fosters innovation through creative solutions and constructive dialogue and feelings toward the company, coworkers, and tasks being managed.
  • Successful at communicating in both oral and written forms.
EOE MFDV