Find your next role

Discover amazing opportunities across our network of companies committed to gender equality in the workplace.

Data Scientist



This job is no longer accepting applications

See open jobs at IBM.
Data Science
Posted on Tuesday, January 9, 2024
As a Data Scientist at IBM, you will help transform our clients’ data into tangible business value by analyzing information, communicating outcomes and collaborating on product development. Work with Best in Class open source and visual tools, along with the most flexible and scalable deployment options. Whether it’s investigating patient trends or weather patterns, you will work to solve real world problems for the industries transforming how we live.

Your Role and Responsibilities

Octo, an IBM company, is an industry-leading, award-winning provider of technical solutions for the federal government. At Octo, we specialize in providing agile software engineering, user experience design, cloud services, and digital strategy services that address government’s most pressing missions. Octo delivers intelligent solutions and rapid results, yielding lower costs and measurable outcomes.

Our team is what makes Octo great. At Octo you’ll work beside some of the smartest and most accomplished staff you’ll find in your career. Octo offers fantastic benefits and an amazing workplace culture where you will feel valued while you perform mission critical work for our government. Voted one of the region’s best places to work multiple times, Octo is an employer of choice!


We are looking for a mid level Data Scientist to support an initiative within the Department of Veterans Affairs (VA). In this role you will work across our client engagements, providing expertise in cloud computing, such as Azure Cloud including AML, ADF/Azure Cloud, Machine Learning, NLP, Bigg data procession, Depp learning, Azure/Databricks/Data Lake, ADF, Python (data wrangling, joins, cleaning, visualization, statistical learning), Azure SQL, Git, Apache Spark (cluster computing/managing). You will be responsible for solutioning and development of production-ready statistical and machine learning models that leverage healthcare, healthcare operations, and related datasets, as well as contribute to and produce technical and data process documentation.


We were founded as a fresh alternative in the Government Consulting Community and are dedicated to the belief that results are a product of analytical thinking, agile design principles and that solutions are built in collaboration with, not for, our customers. This mantra drives us to succeed and act as true partners in advancing our client’s missions.

Program Mission…

This program supports Veterans Affairs’ strategic mission of furthering efforts to modernize its data analytics platform and enhance accessibility to enterprise data and reporting tools.


  • Work across client engagements, providing expertise in data collection, data analysis, data mapping, data profiling, data mining and data modeling.
  • Responsible for inspecting, cleansing, transforming, and modeling data and will address issues related to data completeness and quality.
  • Work directly with our software development team to ensure that we are creating best-in-class solutions to solve our customers’ complex data challenges.
  • Develop predictive and machine learning models from both structured and unstructured data (e.g., identify usage patterns, predict utilization).
  • Create training materials for individuals at different levels of experience to leverage these models and results in their own work.
  • Identify, create, and curate training, test, and validation datasets.
  • Support development of monitoring and re-training procedures for models in production.
  • Act as a subject matter expert throughout the lifecycle of projects to support both business and technical stakeholders in generating solutions and associated requirements.

Years of Experience: 5 years of recent professional experience in data science, data mining, data analysis, business process analysis, and/or healthcare analytics.

Education: .Bachelors degree in a quantitative field such as Computer Science, Statistics, or Mathematics (or related field) required. Prefer M.S. and/or Ph.D.

Location: Remote within the United States.

Clearance: Ability to obtain a Public Trust security clearance.

Required Technical and Professional Expertise

  • See below for experience and educational requirements.
  • Must have at least 5 years of recent professional experience in data science, data mining, data analysis, business process analysis, and/or healthcare analytics.
  • Prefer at least 3-5 years working with Machine Learning and/or Natural Language Processing (in particular, Named Entity Recognition, Deep Learning, Neural Networks, Image Processing, NLP, Spark, TensorFlow) methodologies.
  • Prefer at least 3-5 years of programming experience: SQL, Python (Pandas, Py Spark, Py Tourch).
  • Prefer at least 3-5 years of experience using Machine Learning tools, deploying models, and deploying software in Azure (certification preferred).
  • Ability to conduct data profiling and predictive analysis using a variety of standard tools.
  • Experience with data visualization tools and methodologies.
  • Experience with Azure Data Lake Storage, Azure Data Factory, Azure SQL DW, Azure Synapse, Databricks, Spark, and/or Python Predictive Modeling: Linear, logistic regressions, Ordinary Least Squares, Decision Trees / K-means clustering, K-nearest neighbors, Hierarchal clustering.
  • Experience with Databases: SQL (MySQL, SQLite, PostgreSQL, T-SQL) Data Lake.
  • Experience with Data Visualization: Tableau, Power BI.
  • Experience with Version Control: Git, GitHub, Confluence.
  • Data Science Methods: Problem Identification, Data Mining, Wrangling, Analytics, EDA, FeatureEngineering, Modeling, Report Writing.
  • Experience with ML Applications, end to end process (Local & cloud development).
  • Experience with ML Models (Classical models • Deep Learning (text, audio, images).
  • Model pipelines and deployment • AutoML • MLFlow • Databricks.
  • Excellent ability to communicate concisely and effectively with engineers and clients.
  • Clearance: Ability to obtain a Public Trust security clearance.

Preferred Technical and Professional Expertise

  • Previous experience working with the Dept. of Veterans Affairs or other government clients such as Dept. of Defense (DoD).
  • Exposure to Microsoft Azure services and cloud-based systems.
  • Prior experience with metadata management to include meta-tagging.
  • Previous experience working in an Agile Team setting and using Agile management tools such as Jira.
  • Ability to uncover data-driven insights using statistical analysis or predictive analytics.
  • Experience with machine learning, natural language processing, and statistical analysis methods to include classification, collaborative filtering, association rules, sentiment analysis, topic modeling, time-series analysis, regression, statistical inference, and/or validation methods.

This job is no longer accepting applications

See open jobs at IBM.