Senior AI/ML Data Scientist

Woodlawn, Maryland, United States

Full-time
Salary: Not Available

Posted on: Oct 21, 2024
Expires on: Nov 21, 2024

JOB TITLE:

Senior AI/ML Data Scientist

JOB Type:

CTC

JOB SKILLS:

Not Provided

JOB Location:

Woodlawn, Maryland, United States

JOB DESCRIPTION

**Job Title : Senior AI/ML Data Scientist**

**Location : Woodlawn, MD (Hybrid)**

**Duration : Long Term**

**Note: Selected candidate must reside within two (2) hours of SSA Headquarters in Woodlawn, MD. Selected candidate must be willing to work on-site at least 3 days a week.**

**Position Description:**

- Staying updated on the new methods in NLP, ML and Generative AI.
- Understanding real world challenges and developing automated data solutions
- Developing, testing, and deploying new techniques for NLP understanding
- Scalable development/deployment of ML and Generative AI approaches (such as Large Language Models (LLMs)
- Training and optimizing NLP/LLM models and creating Python based pipeline.
- Determine the nature of analytic problems, evaluate options, and offer recommendations for resolution.
- Advise on the methods and data needed and/or available to evaluate the (intelligence or data) problem.
- Collaborate with data collectors and analysts to identify and close gaps on complex monitoring problems. Provide accurate, timely, complex, and sophisticated data analysis.
**Skills Requirements:**

**Basic Qualifications:**

- Bachelor's degree in Statistics, Applied Mathematics, Computer Science, or Information Science with industry experience on NLP, data science, AI/ML/LLM engineering.
- Minimum 8 Year (s) of Data Scientist experience
- Must be able to obtain and maintain a Public Trust.

**Required Skills:**

- Experience with Natural Language Processing (NLP), Generative AI and Large Language Models (LLM)
- Fluency in Python Programming, version control and collaboration with GIT, standard python packages (ex. Pandas, NumPy, matplotlib) and ML frameworks
- Knowledge of TensorFlow, PyTorch, scikit-learn, NLTK, Azure ML (optional), Amazon Web Services EC2.
- Experience with scalable data engineering frameworks such as Apache Spark and orchestration frameworks such as Airflow, and/or experience with semantic search.
- Expert knowledge in conducting data analysis and applying advanced statistical concepts and machine learning methods to build, train, test, and evaluate a variety of supervised and unsupervised analytic models.
- Experience with ML model deployment and operations like Devops, MLOps, LLMOps.
- Experience with NLP and Generative AI libraries like regular expressions (like spacy, LangChain), text annotation tools and semantic frameworks.
- Experience with statistical and machine learning software such as pandas and scikit-learn.
- Prior experience working on applications that relates to clinical domain.
- Ability to clean and process large amounts of real-world data.
- Experience retrieving and manipulating data from a variety of data sources included Db2, Oracle, SQL Server, Hadoop, and flat files.
- Experience with database management systems, e.g., MySQL, SQLite, SQL, etc.
- Either experience with, or the ability and willingness to learn distributed processing via the Hadoop ecosystem, i.e., Spark, Impala and Hive.
- Excellent analytical skills to identify potential risks and propose effective solutions.
- Clear communication skills to convey complex technical concepts to various partners.
- Ability to collaborate with cross-functional teams.
- Providing problem solving skills, proven communication in written and verbal formats to various audiences to include executive leadership.

Position Details

Posted:

Oct 21 2024

Employment:

CTC

INDUSTRY:

Salary:

Not Disclosed

REFERENCE NUMBER:

OOJ - 11667

CITY:

Woodlawn

JOB ORIGIN:

oorwin