*VIRTUAL* – Machine Learning and Data Science for Upstream Professionals

Request info

Online Upstream Training


The course aims to provide upstream professionals with a comprehensive introduction to the main machine learning methods and builds hands-on experience in data science and machine learning.
Through the course, you will develop a solid understanding of supervised and unsupervised learning algorithms including advanced topics such as deep learning and machine learning models explainability.
The course is designed to build up your confidence from scratch: starting with an introduction of each method in simple terms, followed by detailed guidelines on how to apply different machine learning methods for solving actual problems from reservoir engineering, geo-modelling, and petrophysics. The knowledge obtained from the course – in combination with carefully designed code examples – can be applied by the participants in ongoing and future projects, thus increasing their overall performance.

Course Structure:

10 modules of max. 4 hours each, delivered over 5 weeks (2 days per week)
Detailed Course Schedule:
  • Week 1: April 29-30, 2021 (Thursday – Friday)
  • Week 2: May 6-7, 2021 (Thursday – Friday)
  • Week 3: May 10-11, 2021 (Monday – Tuesday)
  • Week 4: May 20-21, 2021(Thursday – Friday)
  • Week 5: May 27-28, 2021 (Thursday – Friday)
  • Course Hours: 09:00 am – 13:00 pm CET

Course Level: Skilled


  • A reservoir engineer, geologist or petrophysicist, and keen to obtain a fundamental understanding and practical knowledge on scientific programming, data science and machine learning

Participants should have upstream domain knowledge. Prior programming experience is a plus, but not required.


  • The main machine learning methods will be discussed and illustrated with multiple re-usable code examples and real data sets
  • Solutions of multiple problems related to reservoir engineering, geology and petrophysics will be demonstrated using state-of-the-art machine learning libraries


By the end of the course you will feel confident in your understanding of:

  • Core concepts of machine learning and data science
  • Identifying existing bottlenecks for machine learning methods application in your professional domain
  • Choosing the most appropriate machine learning methods to solve a particular problem
  • Applying the main machine learning methods in practice


Week 1:

  • Introduction to Machine Learning ecosystem
  • Python crash course
  • Data wrangling (using Pandas and SQL)
  • Data visualisation


  • Production data analysis and visualisation
  • Data preparation for material balance calculations
  • Reservoir simulation model QC
  • Well log data visualisation

You will learn how to

  • Confidently use Python programming language and the main machine learning libraries to solve different problems from upstream domain
  • Create a powerful and reusable workflow for production data analysis from different sources (local files and production databases) that can be applied for small and large oil and gas fields
  • Quickly prepare production and pressure data for material balance calculation for the reservoirs of high-level of complexity (multiple compartments and pressure datums) in the format of industry-standard software (PETEX MBAL)
  • Analyse a large number of reservoir simulation runs in an efficient way, quickly getting insights into history matching quality and forecasting results
  • Easily create high-quality visualisation of different kinds of field and well data (production, pressure, well log) to simplify the data analysis and get ready-to-use plots for presentations and reports

Week 2:

  • Numerical optimisation
  • Statistics refresher
  • Exploratory data analysis
  • Uncertainty evaluation and decision making


  • Decline curve analysis
  • PVT data preparation for reservoir simulation
  • Volume-in-place probabilistic estimation
  • Static model upscaling
  • Waterflood optimisation

You will learn how to

  • Apply different numerical optimisation methods to solve practical problems from reservoir engineering domain (fitting rate-time data to understand the reservoir depletion mechanism, matching the reservoir pressure gradient with PVT data for consistent reservoir simulation model initialisation)
  • Perform smart upscaling of the fine grid static model into the coarse grid reservoir simulation model with precise control of the upscaling process and finding a trade-off between model dimensionality reduction and the level of geological details preservation
  • Perform the probabilistic volume-in-place estimation taking into account the uncertainty of input parameters to quickly evaluate volumetrics without building a full-scale geological model
  • Allocate water and gas injection volume between injection wells to maximise oil production using the optimal number of reservoir simulation runs

Week 3:

  • Machine learning introduction
  • Dimensionality reduction methods
  • Clustering methods
  • Anomaly detection methods


  • Electrofacies identification based on well log data
  • Static model realisations screening
  • Numerical well testing

You will learn how to

  • Confidently apply machine learning terminology and identify technical and business requirements for successful application of machine learning methods
  • Choose the most suitable machine learning method to solve a particular problem from the upstream domain depending on the type of the problem, data availability, data quality and solution requirements
  • Perform screening of static model scenarios to simplify the history matching process, reduce the number of simulation runs and efficiently evaluate the impact of geological uncertainty on production forecast
  • Identify the optimal number of electrofacies for a modelling study to guide the distribution of properties in the reservoir model
  • Prepare the pressure data for pressure transient analysis (PTA) by automatically removing error pressure measurements to reduce the amount of manual efforts and build a fully automatic workflow for PTA

Week 4:

  • Machine learning core concepts
  • Regression methods
  • Tuning of machine learning models


  • Production forecast of unconventional reservoir
  • Saturation pressure prediction

You will learn how to

  • Design and perform machine learning study to ensure the solution quality and reproducibility of the modelling results
  • Apply on practice and understand the main concepts of machine learning modelling: train/test split, cross-validation, objective function definition, bias-variance trade-off, hyperparameters tuning
  • Predict the performance of a new well and optimise the well completion design for unconventional reservoirs without building a sound physics-based reservoir simulation model
  • Develop a powerful data-driven model incorporating available fluid studies and predict the saturation pressure with high accuracy for the reservoirs with missing key PVT experiments
  • Automatically find the combination of machine learning model parameters to simplify the model tuning and reduce the amount of manual efforts

Week 5:

  • Classification methods
  • Neural networks and Deep learning
  • Advanced machine learning topics:
    – Imbalanced datasets
    – Interpretability of machine learning models


  • Lithofacies identification
  • Screening of enhanced oil recovery (EOR) methods

You will learn how to

  • Explain machine learning modelling results to technical and business audience to perform QA/QC solution and support decision making
  • Develop a robust classification model for lithofacies identification based on well logs for wells without core data
  • Create enhanced oil recovery screening model that allows incorporating different sources of information (PVT, SCAL, geological data), performing screening of a company’s fields portfolio in an efficient way and identifying the most suitable EOR method for a particular field

Cost: 3450 Euros + Vat

SELECT wp_posts.*, wp_p2p.* FROM wp_posts INNER JOIN wp_postmeta ON ( wp_posts.ID = wp_postmeta.post_id ) INNER JOIN wp_p2p WHERE 1=1 AND ( ( wp_postmeta.meta_key = 'start_date' AND CAST(wp_postmeta.meta_value AS DATE) >= '2024-07-15' ) ) AND ((wp_posts.post_type = 'schedule' AND (wp_posts.post_status = 'publish' OR wp_posts.post_status = 'acf-disabled'))) AND (wp_p2p.p2p_type = 'schedule_to_courses' AND wp_posts.ID = wp_p2p.p2p_from AND wp_p2p.p2p_to IN (SELECT wp_posts.ID FROM wp_posts WHERE 1=1 AND wp_posts.ID IN (20989) AND ((wp_posts.post_type = 'courses' AND (wp_posts.post_status = 'publish' OR wp_posts.post_status = 'acf-disabled'))) ORDER BY wp_posts.post_date DESC )) GROUP BY wp_posts.ID ORDER BY CAST(wp_postmeta.meta_value AS DATE) ASC