Description
As a Principal Data Scientist, you will build and model digital product and platforms that bring Amgen’s AI/ML and GenAI solutions to life.
This role has to collaborate with Amgen’s Technical Architect, Product Manager, UX designers, and Back-end engineers to design secure, scalable, and user-centric products that accelerate discovery, manufacturing, and commercial analytics, corporate functions products
**Key Responsibilities**
+ Strong experience with statistics and machine learning, including deep learning, natural language processing (NLP) and, experience in building cloud-scale systems and working with open-source stacks for data
+ Experiment with large language models (LLM), Artificial intelligence (AI) for code or related fields, Generative AI, Foundational Models, Supervised and Unsupervised Learning
+ Collaborate with cross-functional teams to understand the requirement and design solutions that meet business needs. Also with Data Architects, Business SMEs, and Technical Architect to design and develop to meet fast-paced business needs.
+ Explore new tools and technologies that will help rapid development of solutions
+ Participate in sprint planning meetings and provide estimations on technical implementation
+ Embed responsible-AI and security-by-design controls
**Required Qualifications**
+ 6 –12 years of experience in Data Science (eg: managing structured and unstructured data, applying statistical techniques and reporting results)
+ Doctorate in Computer Science, Data Science, Mathematics, Statistics, Econometrics, Economics, Operations Research, Computer Science or a related field
+ Leverage cloud platforms (AWS preferred) to build scalable and efficient solutions
+ Strong background in Deep Learning, Machine Learning, NLP, Data Mining.
+ Excellent communication and stakeholder management skills.
**Preferred Skills**
+ Experience in Generative AI, Foundational Models, LLM’s, Feature Engineering, Selection & Extraction, BI & Automation, Predictive Modelling, Data Visualization, CNN, RNN, GNN, Transformers, Exploratory Data Analysis
+ Familiarity with Python packages (Pytorch, TensorFlow, Hugging FaceScikit-learn, Pandas, NumPy, Matplotlib, Cloud Vision API, RAG, TensorBoard, OpenCV, NLTK)), programming languages ( C, C++, Java, CUDA,SQL, NoSQL, PHP, HTML, JS, CSS etc.)
+ Prior exposure to pharma / life sciences AI environments is preferred.
+ Strong understanding of Responsible AI and model validation principles.





