Data Engineer, Industrial Placement/Graduate- Summer/Sept 2023 start
Vortexa was founded to solve the immense information gap that exists in the energy industry. By using massive amounts of new satellite data and pioneering work in artificial intelligence, Vortexa creates an unprecedented view on the global seaborne energy flows in real-time, bringing transparency and efficiency to the energy markets and society as a whole.
Processing thousands of rich data points per second from many vastly different external sources, moving terabytes of data while processing it in real-time, running complex prediction and forecasting AI models while coupling their output into a hybrid human-machine data refinement process and presenting the result through a nimble low-latency SaaS solution used by customers around the globe is no small feat of science and engineering. This processing requires models that can survive the scrutiny of industry experts, data analysts and traders, with the performance, stability, latency and agility a fast-moving startup influencing multi-$m transactions requires.
The Data Production Team is responsible for all of Vortexa’s data. It ranges from mixing raw satellite data from 600,000 vessels with rich but incomplete text data, to generating high-value forecasts such as the vessel destination, cargo onboard, ship-to-ship transfer detection, dark vessels, congestion, future prices, etc
The team has built a variety of procedural, statistical and machine learning models that enabled us to provide the most accurate and comprehensive view of energy flows. We take pride in applying cutting-edge research to real-world problems in a robust, long-lasting and maintainable way. The quality of our data is continuously benchmarked and assessed by experienced in-house market and data analysts to ensure the quality of our predictions.
You’ll be instrumental in designing and building infrastructure and applications to propel the design, deployment, and benchmarking of existing and new pipelines and ML models. Working with software and data engineers, data scientists and market analysts, you’ll help bridge the gap between scientific experiments and commercial products by ensuring 100% uptime and bulletproof fault-tolerance of every component of the team's data pipelines.
Area of Responsibilities