Senior AI/ML Data Scientist
- Woodlawn, Maryland, United States
- Full-time
- Salary: Not Available
- Posted on:
- Expires on:
JOB TITLE:
Senior AI/ML Data Scientist
JOB Type:
CTC
JOB SKILLS:
Not Provided
JOB Location:
Woodlawn, Maryland, United States
JOB DESCRIPTION
**Job Title : Senior AI/ML Data Scientist** **Location : Woodlawn, MD (Hybrid)** **Duration : Long Term** **Note: Selected candidate must reside within two (2) hours of SSA Headquarters in Woodlawn, MD. Selected candidate must be willing to work on-site at least 3 days a week.** **Position Description:** - Staying updated on the new methods in NLP, ML and Generative AI. - Understanding real world challenges and developing automated data solutions - Developing, testing, and deploying new techniques for NLP understanding - Scalable development/deployment of ML and Generative AI approaches (such as Large Language Models (LLMs) - Training and optimizing NLP/LLM models and creating Python based pipeline. - Determine the nature of analytic problems, evaluate options, and offer recommendations for resolution. - Advise on the methods and data needed and/or available to evaluate the (intelligence or data) problem. - Collaborate with data collectors and analysts to identify and close gaps on complex monitoring problems. Provide accurate, timely, complex, and sophisticated data analysis.
**Skills Requirements:** **Basic Qualifications:** - Bachelor's degree in Statistics, Applied Mathematics, Computer Science, or Information Science with industry experience on NLP, data science, AI/ML/LLM engineering. - Minimum 8 Year (s) of Data Scientist experience - Must be able to obtain and maintain a Public Trust. **Required Skills:** - Experience with Natural Language Processing (NLP), Generative AI and Large Language Models (LLM) - Fluency in Python Programming, version control and collaboration with GIT, standard python packages (ex. Pandas, NumPy, matplotlib) and ML frameworks - Knowledge of TensorFlow, PyTorch, scikit-learn, NLTK, Azure ML (optional), Amazon Web Services EC2. - Experience with scalable data engineering frameworks such as Apache Spark and orchestration frameworks such as Airflow, and/or experience with semantic search. - Expert knowledge in conducting data analysis and applying advanced statistical concepts and machine learning methods to build, train, test, and evaluate a variety of supervised and unsupervised analytic models. - Experience with ML model deployment and operations like Devops, MLOps, LLMOps. - Experience with NLP and Generative AI libraries like regular expressions (like spacy, LangChain), text annotation tools and semantic frameworks. - Experience with statistical and machine learning software such as pandas and scikit-learn. - Prior experience working on applications that relates to clinical domain. - Ability to clean and process large amounts of real-world data. - Experience retrieving and manipulating data from a variety of data sources included Db2, Oracle, SQL Server, Hadoop, and flat files. - Experience with database management systems, e.g., MySQL, SQLite, SQL, etc. - Either experience with, or the ability and willingness to learn distributed processing via the Hadoop ecosystem, i.e., Spark, Impala and Hive. - Excellent analytical skills to identify potential risks and propose effective solutions. - Clear communication skills to convey complex technical concepts to various partners. - Ability to collaborate with cross-functional teams. - Providing problem solving skills, proven communication in written and verbal formats to various audiences to include executive leadership.
Position Details
Posted:
Employment:
CTC
INDUSTRY:
-
Salary:
Not Disclosed
REFERENCE NUMBER:
OOJ - 11667
CITY:
Woodlawn
JOB ORIGIN:
oorwin