
Find your future role
Job title
Senior Data Scientist (Engineering Focus)
Ref no. | BHN585901 |
---|---|
Salary | £110,000 - £145,000/annum |
Location | London, England |
Start date | ASAP |
Job type | Permanent |
Job status | Open |
Job summary
Data Scientist with a strong engineering background with experience with real time data processing, 3-4 days on sight paddington.
Key skills required for this role
Data Science - quantitative finance or trading research - Data Engineer
Important
Senior Data Scientist
Job description
About the Role
We are seeking a highly skilled Senior Data Scientist to design and implement a robust data pipeline for an AI/LLM-driven trading platform. The ideal candidate will be responsible for collecting, extracting, cleaning, normalizing, and structuring large-scale data from sources (e.g., financial articles, books, and research papers). This role requires deep expertise in data engineering, NLP, and financial market data to ensure high-quality datasets for machine learning and trading strategy development.
Key Responsibilities
- Data Pipeline Development: Build and maintain a scalable and efficient data pipeline that processes unstructured data for AI/LLM models.
- Unstructured Data Processing: Extract and preprocess text from books, articles, and research reports to be utilized in AI/LLM-based analytics.
- ETL & Data Engineering: Develop ETL processes to ingest and transform data from multiple sources into a unified format.
- Data Quality Assurance: Implement validation and anomaly detection mechanisms to ensure data integrity and accuracy.
- Collaboration: Work closely with quant researchers, AI/ML engineers, and trading teams to ensure seamless data flow and accessibility.
- Automation & Optimization: Automate data ingestion, transformation, and storage processes for efficiency and scalability.
Requirements
- Strong academic background in Data Science, Machine Learning, Artificial Intelligence, or a related field.
- Experience in building and optimising data pipelines and workflows.
- Proficiency in Python, SQL, and ML libraries (TensorFlow, PyTorch, scikit-learn, etc.) and at least one Object-Oriented Programming (OOP) language (e.g., Rust, C++, C#, …)
- Hands-on experience with LLMs, NLP techniques, and AI-driven data processing.
- Experience working with financial datasets and understanding of market data structures, is preferred.
- Knowledge of APIs, cloud computing (AWS, GCP, Azure), and database management.
- Strong problem-solving skills with the ability to work on complex data-driven projects.
Nice to Have
- Background in quantitative finance or trading research.
- Experience in real-time data processing and streaming architectures.
- Exposure to cryptocurrency markets.