Tag Archives: machine learning

Riding the data: How a transportation company used data science to improve decision-making processes

How can a company leverage customer data and turn it into actionable information? This was the challenge one transportation provider faced when its modeling system began underperforming after the company implemented it to predict revenue and passenger traffic. In this article, Milliman consultant Antoine Ly discusses how the firm created a machine-learning model that helps the company analyze various aspects of its ridership, leading to more informed financial decisions.

Here is an excerpt:

Working from a mock-up drafted by the client, the [Milliman] team reproduced the dashboard to the client’s specifications, but it is now supported by newly developed software as well as the client’s data warehouse. The dashboard allows the client’s management team to quire different aspects of passenger usage to gain insight into traffic flows and revenue. Colour-coded symbols, which when clicked on, give managers a concise picture of a train’s revenue and traffic. Managers can also quire the system based on selected features for both past usage and anticipated ridership, and are now able to make more informed decisions about pricing, the need for discounts or adjustments to marketing campaigns.

Because the model can adapt to new situations, deviations from the average error are confined to a much more narrow range. This means managers can have more confidence in the model’s predictive value and increases their ability to manage revenue.

Emerging risk analytics: Application of advanced analytics to the understanding of emerging risk

This report by Milliman’s Neil Cantle uses advanced machine learning algorithms, such as deep neural networks, to analyse social media conversations about Brexit. The purpose of the study was to examine whether useful information could be extracted from social media in what is effectively real time on a key topic in a political economy.

Validating machine-learning models

While machine-learning techniques can improve business processes, predict future outcomes, and save money, they also increase modeling risk because of their complex and opaque features. In this article, Milliman’s Jonathan Glowacki and Martin Reichhoff discuss how model validation techniques can mitigate the potential pitfalls of machine-learning algorithms.

Here is an excerpt:

An independent model validation carried out by knowledgeable professionals can mitigate the risks associated with new modeling techniques. In spite of the novelty of machine-learning techniques, there are several methods to safeguard against overfitting and other modeling flaws. The most important requirement for model validation is for the team performing the model validation to understand the algorithm. If the validator does not understand the theory and assumptions behind the model, then they are likely to not perform an effective model validation on the process. After demonstrating an understanding on the model theory, the following procedures are helpful in performing the validation.

Outcomes analysis refers to comparing modeled results to actual data. For advanced modeling techniques, outcomes analysis becomes a very simple yet useful approach to understanding model interactions and pitfalls. One way to understand model results is to simply plot the range of the independent variable against both the actual and predicted outcome along with the number of observations. This allows the user to visualize the univariate relationship within the model and understand if the model is overfitting to sparse data. To evaluate possible interactions, cross plots can also be created looking at results in two dimensions as opposed to a single dimension. Dimensionality beyond two dimensions becomes difficult to evaluate, but looking at simple interactions does provide an initial useful understanding of how the model behaves with independent variables….

…Cross-validation is a common strategy to help ensure that a model isn’t overfitting the sample data it’s being developed with. Cross-validation has been used to help ensure the integrity of other statistical methods in the past, and with the rising popularity of machine-learning techniques, it has become even more important. In cross-validation, a model is fitted using only a portion of the sample data. The model is then applied to the other portion of the data to test performance. Ideally, a model will perform equally well on both portions of the data. If it doesn’t, it’s likely that the model has been over fit.

The Data Science Game is back

Registration for the 2017 Data Science Game is officially open. The Data Science Game is a two-phase competition showcasing teams of data science students from universities around the world. An online qualifier will take place on April 15 with the final stage happening in September.

Students from the Moscow Institute of Physics and Technology (MIPT) won last year’s competition. Will your university win this year? To register your team, visit www.datasciencegame.com. The deadline to register is April 9.

Milliman is a sponsor of the 2017 Data Science Game.

Milliman Pixel

Milliman’s Pixel is a web-based, competitive analytics platform that helps insurers use objective and comprehensive information to grow their business.

In this video, Milliman actuaries Nancy Watkins, Peggy Brinkman, and Cody Webb discuss how Pixel helps insurers compare their premiums with those of competitors, identify market sectors where they might be experiencing adverse selection, and access competitive information needed to make sound pricing decisions.

To learn more about Pixel, click here.

The Data Science Game’s qualification round is complete

In July, teams of data science students from more than 50 universities around the globe competed in the qualification phase of the 2016 Data Science Game. Over 140 teams of four students were asked to develop an algorithm that could recognize the orientation of a roof from a satellite photograph by building on more than 10,000 photograph of roofs categorized through crowdsourcing.

Twenty-two teams have qualified for the final phase. The top three ranking teams were Jonquille (University Pierre and Marie Curie), PolytechNique (Ecole Polytechnique), and The Nerd Herd (University of Amsterdam). The final is being held in Paris on September 10 and 11, where the teams will compete in a big data analysis challenge.

Data Science Game finalists

For more information on the Data Science Game, click here.

Milliman is a sponsor of the 2016 Data Science Game.