Axle Informatics

Data Scientist - Data Quality

Job Locations US-MD-Rockville
Posted Date 7 months ago(10/26/2023 11:06 AM)
# of Openings


Axle is a bioscience and information technology company that offers advancements in translational research, biomedical informatics, and data science applications to research centers and healthcare organizations nationally and abroad. With experts in biomedical science, software engineering, and program management, we focus on developing and applying research tools and techniques to empower decision-making and accelerate research discoveries. We work with some of the top research organizations and facilities in the country including multiple institutes at the National Institutes of Health (NIH).

Axle is seeking a Data Scientist - Data Quality to join our vibrant team at the National Institutes of Health (NIH) supporting the National Center for Advancing Translational Sciences (NCATS) located in Rockville, MD. 


Benefits We Offer:

  • 100% Medical, Dental & Vision Coverage for Employees
  • Paid Time Off and Paid Holidays
  • 401K match up to 5%
  • Educational Benefits for Career Growth
  • Employee Referral Bonus
  • Flexible Spending Accounts:
    • Healthcare (FSA)
    • Parking Reimbursement Account (PRK)
    • Dependent Care Assistant Program (DCAP)
    • Transportation Reimbursement Account (TRN)


We are looking for a skilled and motivated Data Quality expert to lead the development of Real World Data (RWD) assets to support projects at the NIH.  The position will be based at Axle in Bethesda, Maryland, but functionally remote from US locations, with occasional travel annually. 


As a Data Scientist with concentration in Data Quality for the All of Us Research Program, you will be responsible for designing, developing, and implementing systems to ensure the accuracy, completeness, reliability and study-readiness of the Center for Linkage and Acquisition of Data (CLAD) platform. You will collaborate with cross-functional teams, including software developers, data engineers, data scientists, epidemiologists, and project managers to ensure the successful delivery of a scalable, secure, and efficient research platform.  Your role is key in providing up-front measures of database contents, missingness, variable quality and several other dimensions.  Creativity in your approach to digesting massive amounts of information and how you communicate those findings back to the CLAD community will be essential.



  • Produce criteria to evaluate improvements in the quality and scope of analyses of PPRL-linked Real World Data.

  • Design, implement, and maintain systems for checking and reporting on the quality of complex research databases, including assessment of missingness, validation of content and readiness for the application of statistical methods.

  • Perform end to end QA processes for research databases.

  • Maintain database dictionaries, schemas, diagrams and documentation. 

  • Work with internal and external collaborators to examine, transfer, and index data towards development of automated data processing pipelines. 

  • Understand the business issues and data challenges of enterprise Real World databases used for observational research.

  • Participate in all aspects of business analysis, testing, including functional, regression, integration, load and system testing.

  • Review and edit requirements, specifications, business processes and recommendations related to proposed solutions.

  • Take the initiative to suggest new standards, implement new strategies, and take on special projects.

  • Investigate and resolve operational problems in conjunction with other engineering and technical personnel.

  • Provide technical support and advice to Leads, PIs, technical staff and other engineering groups.

  • Keep aware of developments and trends in best practices for data quality analysis. 



  • MS in computer science, (bio)statistics, informatics or similar quantitative degree preferred.

  • Minimum Bachelor's degree in statistics, mathematics, computer science, information management, or similar.

  • At least 5 years of experience in data analysis.

  • Proficiency in programming and scripting languages, including Python, R and SQL.

  • Working knowledge of statistical methods and tests.

  • Familiar with large claims and EHR databases

  • Knowledge of PPRL databases

  • Proven track record of working with real world clinical data.

  • Familiarity with medical coding systems (ICD, CPT, HCPCS, NDC, LOINC, SNOMED, etc.)

  • Exceptional analytical skills.

  • Advanced problem-solving skills.

  • Knowledge of best practices in data analysis.

  • Excellent interpersonal and communication skills.

Disclaimer:The above description is meant to illustrate the general nature of work and level of effort being performed by individual’s assigned to this position or job description. This is not restricted as a complete list of all skills, responsibilities, duties, and/or assignments required. Individuals may be required to perform duties outside of their position, job description or responsibilities as needed.

The diversity of Axle’s employees is a tremendous asset. We are firmly committed to providing equal opportunity in all aspects of employment and will not tolerate any illegal discrimination or harassment based on age, race, gender, religion, national origin, disability, marital status, covered veteran status, sexual orientation, status with respect to public assistance, and other characteristics protected under state, federal, or local law and to deter those who aid, abet, or induce discrimination or coerce others to discriminate.

Accessibility: If you need an accommodation as part of the employment process please contact:


Sorry the Share function is not working properly at this moment. Please refresh the page and try again later.
Share on your newsfeed