Senior AI/ML Data Scientist

  • Woodlawn, Maryland, United States
  • Full-time
  • Salary: Not Available
  • Posted on:
  • Expires on:

JOB TITLE:

Senior AI/ML Data Scientist

JOB Type:

CTC

JOB SKILLS:

Not Provided

JOB Location:

Woodlawn, Maryland, United States

JOB DESCRIPTION

**Job Title : Senior AI/ML Data Scientist**

**Location : Woodlawn, MD (Hybrid)**

**Duration : Long Term**

**Note: Selected candidate must reside within two (2) hours of SSA Headquarters in Woodlawn, MD. Selected candidate must be willing to work on-site at least 3 days a week.**

**Position Description:**

- Staying updated on the      new methods in NLP, ML and Generative AI.
- Understanding real world      challenges and developing automated data solutions
- Developing, testing, and      deploying new techniques for NLP understanding
- Scalable      development/deployment of ML and Generative AI approaches (such as Large      Language Models (LLMs)
- Training and optimizing      NLP/LLM models and creating Python based pipeline.
- Determine the nature of      analytic problems, evaluate options, and offer recommendations for      resolution.
- Advise on the methods      and data needed and/or available to evaluate the (intelligence or data)      problem.
- Collaborate with data      collectors and analysts to identify and close gaps on complex monitoring      problems. Provide accurate, timely, complex, and sophisticated data      analysis.
**Skills Requirements:** **Basic Qualifications:** - Bachelor's degree in Statistics, Applied Mathematics, Computer Science, or Information Science with industry experience on NLP, data science, AI/ML/LLM engineering. - Minimum 8 Year (s) of Data Scientist experience - Must be able to obtain and maintain a Public Trust. **Required Skills:** - Experience with Natural Language Processing (NLP), Generative AI and Large Language Models (LLM) - Fluency in Python Programming, version control and collaboration with GIT, standard python packages (ex. Pandas, NumPy, matplotlib) and ML frameworks - Knowledge of TensorFlow, PyTorch, scikit-learn, NLTK, Azure ML (optional), Amazon Web Services EC2. - Experience with scalable data engineering frameworks such as Apache Spark and orchestration frameworks such as Airflow, and/or experience with semantic search. - Expert knowledge in conducting data analysis and applying advanced statistical concepts and machine learning methods to build, train, test, and evaluate a variety of supervised and unsupervised analytic models. - Experience with ML model deployment and operations like Devops, MLOps, LLMOps. - Experience with NLP and Generative AI libraries like regular expressions (like spacy, LangChain), text annotation tools and semantic frameworks. - Experience with statistical and machine learning software such as pandas and scikit-learn. - Prior experience working on applications that relates to clinical domain. - Ability to clean and process large amounts of real-world data. - Experience retrieving and manipulating data from a variety of data sources included Db2, Oracle, SQL Server, Hadoop, and flat files. - Experience with database management systems, e.g., MySQL, SQLite, SQL, etc. - Either experience with, or the ability and willingness to learn distributed processing via the Hadoop ecosystem, i.e., Spark, Impala and Hive. - Excellent analytical skills to identify potential risks and propose effective solutions. - Clear communication skills to convey complex technical concepts to various partners. - Ability to collaborate with cross-functional teams. - Providing problem solving skills, proven communication in written and verbal formats to various audiences to include executive leadership.

Position Details

Posted:

Employment:

CTC

INDUSTRY:

-

Salary:

Not Disclosed

REFERENCE NUMBER:

OOJ - 11667

CITY:

Woodlawn

JOB ORIGIN:

oorwin