*SEEKING WORK | Remote | UK (Open to time zone overlap)*
I’m Joshua Greenhalgh, a senior data engineer and software engineer with over 6 years of experience in building scalable, reliable data pipelines and frameworks, coupled with a strong academic background in computational modeling and applied mathematics.
### *Professional Highlights* - *Machine Learning Pipelines*: Built scalable ML frameworks integrating AWS SageMaker and Dagster for cell-level classification models in flow cytometry, reducing training times and enhancing reproducibility. - *Data Engineering*: Migrated ad-hoc pipelines to Prefect orchestrated systems and rebuilt data warehouses for analytics teams, improving data accessibility and reliability. - *Cloud Expertise*: Extensive experience with GCP, AWS, Terraform, and Kubernetes for building cost-efficient and scalable infrastructure. - *Project Leadership*: Designed solutions across domains, from humanitarian mobility analysis with Flowminder to ad-serving analytics at MOBKOI. - *Teaching & Mentoring*: Conducted training on Python, R, and HPC programming, empowering teams to deploy their own pipelines.
### *Academic Highlights* - *Ph.D. Research* (University of Southampton, incomplete): Investigated Bayesian and deep learning approaches to non-linear inverse problems in X-ray tomography, focusing on artifact reduction. - *M.Sc. in Computational Modeling*: Specialized in molecular dynamics, Monte Carlo simulation, and advanced numerical methods. - *Teaching Roles*: Taught undergraduate and postgraduate courses, including parallel programming on HPC infrastructure and advanced Python programming for research. - *Dissertations*: Applied deep learning to signal deconvolution and developed computational approaches to problems in group theory.
### *Technologies* Python, Prefect, Airflow, Kafka, GCP, AWS, Terraform, Kubernetes, Docker, SQL, Postgres, Snowplow, Tableau, React.
I’m passionate about combining academic rigor with industry expertise to solve complex data and infrastructure challenges. If you’re looking for someone to scale data pipelines, optimize ML workflows, or enhance your cloud infrastructure, I’d love to help.
*Contact me at*: joshuadouglasgreenhalg at gmail dot com *GitHub*: [josh-gree](https://github.com/josh-gree)