Lead Data Scientist


Job Details

Job Title: Senior Data Engineer

Locations: Remote

Clearance: Eligible to obtain a Secret Clearance for upcoming work

Job Summary

IntelliBridge is seeking a Lead Data Scientist to lead a data science applied research team to develop novel methods of extracting insights and analysis of massive disparate datasets. As a lead data scientist, you will identify and source data to be consumed by our data lakehouse ELT processes and contribute to smart AI-enabled data pipelines. You will be responsible for using AI to mine, enrich, fuse, and visualize data to provide insight to our clients. Within this role, you'll actively collaborate with both technical and non-technical members of the data and development teams to define requirements and consistently deploy top-notch data products. Our primary goal is to deliver agile value to our stakeholders. We seek an individual with an insatiable curiosity about data, a genuine passion for comprehensively understanding datasets, and a keen attention to detail essential for ensuring accurate data comprehension.

Job Responsibilities

Guide and support the implementation of new data science solutionsDesigns, architects, and supports key datasets that provide structured and timely access to actionable business insights or decision makingImplement, test, deploy, and maintain stable, secure, and scalable data mining, enrichment, fusion, and predictive AI/ML solutionsFine tune and deploy Generative LLM AI models for specific tasksDevelop Retrieval Augmented Generation (RAG) services using open-source tools and models to provide AI assistant services that have access to internal data sourcesLead cross functional teams to develop data intensive software productsSupport all data staff in troubleshooting code issues, perform code reviews, and devising testing strategiesMonitor existing metrics, analyze data, and lead partnership with other Data and Analytics personnel to identify and implement system and process improvementsLead a team to develop processes that ingest multiple data sources, enrich data with AI/ML process, and provide that data to other data consumersLead a team to maintain the infrastructure to support extraction, loading, and transformation (ELT) of data from a wide variety of data sources.Exposing AI/ML-enriched data via APIs, dashboards, and user applicationsUtilize DevOps Continuous Delivery best practices.Configure and manage data analytic frameworks and pipelines using databases and toolsLead a team to design and manage custom data dashboards using Kibana and Power BI to display data insightsAdminister cloud computing and CI/CD pipelines to include Amazon Web Service (AWS)Contribute to MLOps processes including the deployment and integration of Generative AI LLMs

Position Requirements

Minimum of ten (10) years of Software or Data Science Experience or equivalentBachelor s Degree in Computer Science, Information Technology, or a STEM fieldStrong understanding in data operations and data systemsProficient in Agile DevelopmentAbility to form strong cross-functional relationships and lead a project teamDemonstrated expertise in technical data science and engineering on complex applications, systems, software, and projectsSenior-level experience in analysis, design, development, testing, and implementation of applications.Expertise in developing and maintaining data pipelines and databases for insights, analytics, and visualizations.Deep knowledge in machine learning and artificial intelligence for building enrichment data pipelines and AI backend servicesExcellent verbal and written communicationsGeneral knowledge of Generative Pre-Trained AI Large Language Models (GPT AI LLMs), as well as traditional machine learning approachesDevelop novel data mining processes using GenAI and LLMsLead teams through ML development, training, deployment, monitoring, and support lifecycles

Desired Skills And Abilities (salary Commensurate)

Knowledgeable and experienced in:Data analysis and statisticsMachine LearningPythonGit and Git OperationsSQLAWSDockerTransformers and Natural Language Processing model fine tuningExperience with machine learning processes, solutions, and applicationsExperience with data science algorithms such as boosted decision trees, logistic regression, and autoregressive integrated moving averagesExperience with all parts of the AI/ML lifecycle including training, deployment, and monitoringExperience in network analytics, knowledge graphs, and graph databasesExperience building data pipelines and working with data lakes or lakehouses using DatabricksAdvanced Degree (Master s or PhD)

Additional Preferred Skills And Abilities

Experience in multiple programming languagesExperience with python libraries including transformers, FastAPI, pydanticExperience with AWS services: S3, EC2, Athena, Glue, ECR, ECSExperience with Retrieval Augmented Generation (RAG) LLM systemsExperience working with and fine tuning LLM AI modelsExperience working with CI/CD workflowsExperience with data augmentation techniques

About Us

IntelliBridge delivers IT strategy, cloud, cybersecurity, application, data and analytics, enterprise IT, intelligence analysis, and mission operation support services to accelerate technical performance and efficiency for Defense, Civilian, and National Security & Federal Law Enforcement clients.#J-18808-Ljbffr





 IntelliBridge

 05/03/2024

 Mc Lean,VA