Posts

Unfriendly Skies: Predicting Flight Cancellations Using Weather Data, Part 4 Nov 8, 2018 • Tim Bohn and Ricardo Balduino

In Part 1 of this series, we wrote about our goal to explore a use case and use various machine learning platforms to see how we might build classification models with those platforms to predict flight cancellations. In Part 2, we started our exploration with IBM SPSS Modeler and APIs from The Weather Company. Part 3 looked at doing this with Watson Studio in a python Jupyter notebook. In Part 4 we look at using the new SPSS Modeler flows in Watson Studio.
Football Fan Sentiment Oct 22, 2018 • Ricardo Balduino

In August, students from Yale's Center for Customer Insights presented their findings from a study that used various IBM Analytics tools to analyze social media data from NFL fans. This data was used to gauge fans’ perceptions of the league and popular topics surrounding it and to then use these insights to drive effective social media interactions.
Leveraging Watson’s Machine Learning GPUs to accelerate your Deep Learning project in Python Aug 8, 2018 • Maxime Allard

Use GPUs from the Watson Machine Learning Service for any deep learning project in Python
Statistical Modeling and Machine Learning Applications for Time-Series Problems Jul 31, 2018 • Aakanksha Joshi

Different ways to work with time series use cases
Reinforcement Learning: The Business use-cases Part 2 Jul 26, 2018 • Aishwarya Srinivasan

Reinforcement Learning in Trading
Reinforcement Learning: The Business use-cases Part 1 Jul 24, 2018 • Aishwarya Srinivasan

First blog of the Reinfocement Learning: The Business Use-cases series; Introduction to Reinforcement Learning
Predictive Maintenance Scheduling with IBM Data Science Experience and Decision Optimization Jun 20, 2018 • Yana Ageeva

How to keep your assets healthy and repair costs low by scheduling maintenance at exactly the right time
DSX, GitHub and forking Jun 18, 2018 • Victor Terpstra

Forking a DSX project in GitHub creates a greater separation between users and allows for a more flexibility in merging assets. A greater separation allows DSX users to commit to their linked GitHub repository frequently without immediately impacting other users of the project. Assets are merged between users explicitly using GitHub pull-requests and can only be done when and if the owner(s) of the repository agree. Solving merge conflicts can be performed outside of DSX, either on GitHub or using any of the available Git tools on your local workstation.
Using Machine Learning to Predict Outcomes for Sepsis Patients Apr 30, 2018 • Ricardo Balduino

Sepsis is a complex disease that is difficult to identify and treat. IBM and Geisinger Healthcare System partnered to build a model to predict all-cause mortality of Sepsis patients, which could guide medical teams to closely monitor and take preventive measures for those patients with high probability of death.
GitHub for DSX projects Apr 14, 2018 • Victor Terpstra

Linking a DSX project to a GitHub repository has a number of advantages: backup of your code and assets, easy migration to new DSX clusters, working on multiple clusters simultaneously, working off-line on your local work station, sharing your code, collaboration, etc. This post contains step-by-step instructions of the setup and work flow for the GitHub integration in DSX.
The Journey to Digital: Part 3, Insight Transformation Apr 12, 2018 • Seth Dobrin

What needs to be done differently at this stage? This is about mapping out the decisions people make across the enterprise. You’ll use this full map of those decisions to create a backlog of decisions for your data science teams to tackle. Then to prioritize the backlog, you’ll assign a value to each decision, taking into account the likelihood and ease of implementation.
Using Excel to manage scenarios in Decision Optimization for DSX Apr 4, 2018 • Victor Terpstra

Excel can be used to manage scenario data consisting of multiple tabular data sets in one file. This allows for an easier creation of scenarios for what-if analysis using the Decision Optimization add-on for DSX.
Don’t Let Data Science Become a Scam Mar 15, 2018 • Seth Dobrin

Companies have been sold on the alchemy of data science. They have been promised transformative results. They modeled their expectations after their favorite digital-born companies. They have piled a ton of money into hiring expensive data scientists and ML engineers. They invested heavily in software and hardware. They spend considerable time ideating. Yet despite all this effort and money, many of these companies are enjoying little to no meaningful benefit. This is primarily because they have spent all these resources on too much experimentation, projects with no clear business purpose, and activity that doesn’t align with organizational priorities.
The Journey to Digital: Part 2, Data Transformation Mar 15, 2018 • Seth Dobrin

This stage is about defining the core assets that create value for the enterprise, and it’s about identifying, discovering and governing the right data without necessarily expecting — or forcing — upheaval. At this stage, data consumers might ask for data to support their preconceived notions. That’s actually fine. More often than not, aligning with existing expectations is a necessary step as you build consensus for data science to transform the organization.
The Journey to Digital: Part 1, Table Stakes Feb 14, 2018 • Seth Dobrin

Effectively transforming a company requires a commitment to do things differently, and requires that your partners and vendors do things differently too. To my mind, these days transforming a company means transforming it digitally and any successful digital transformation requires three things. I think of these as the minimum requirements for getting a seat at the table, the table stakes if you will:
Raiders of Every Industry: Journey to Digital Feb 1, 2018 • Seth Dobrin

Every industry will be disrupted in the coming years. None are safe. The safer they seem, the more susceptible they probably are. In fact, only two things stand a chance of protecting incumbents:
Hybrid Use Cases Dominate Machine Learning - Part 2 Jan 25, 2018 • John Thomas

[Part 2] The importance of information architecture in meeting enterprise AI needs, as well as how ML can be applied in hybrid scenarios
Hybrid Use Cases Dominate Machine Learning - Part 1 Jan 16, 2018 • John Thomas

[Part 1] The importance of information architecture in meeting enterprise AI needs, as well as how ML can be applied in hybrid scenarios
How IBM builds an effective data science team Dec 22, 2017 • Seth Dobrin

Data Science is a team sport. While I’m not sure where this phrase was coined, it is an accurate phrase. This must be ringing true with enterprises as well since we often get the question: “What should the structure of a data science team be?” and “Where should a data science team report into?” The first question is probably a little more straight forward and that is what we will address here. The answer to the second question really is it depends on your organizations maturity in this space, but is should be some sort of federated or hub and spoke model eventually. Breaking down what is required to successfully execute a data science project and acknowledging that very few individuals possess all these skills helps to define what a team should look like.
3 Scenarios for Machine Learning on Multicloud Dec 20, 2017 • John Thomas

More and more cloud-computing experts are talking about “multicloud”. The term refers to an architecture that spans multiple cloud environments in order to take advantage of different services, different levels of performance, security, or redundancy, or even different cloud vendors. But what sometimes gets lost in these discussions is that multicloud is not always public cloud. In fact, it’s often a combination of private and public clouds. As machine learning (ML) continues to pervade enterprise environments, we need to understand how to make ML practical on multicloud — including those architectures that span the firewall. Let’s look at three possible scenarios.
Unfriendly Skies: Predicting Flight Cancellations Using Weather Data, Part 3 Dec 13, 2017 • Tim Bohn and Ricardo Balduino

In Part 1 of this series, we wrote about our goal to explore a use case and use various machine learning platforms to see how we might build classification models with those platforms to predict flight cancellations. Specifically, we hoped to predict the probability of the cancellation of flights between the ten U.S. airports most affected by weather. We used historical flight data and historical weather data to make predictions for upcoming flights. In Part 2, we started our exploration with IBM SPSS Modeler and APIs from The Weather Company. With this post, we look at IBM’s Data Science Experience (DSX).
What IBM looks for in a data scientist Nov 30, 2017 • Seth Dobrin & Jean-François Puget

Job seekers sometimes ask how IBM defines “data scientist.” It’s an important question since more and more would-be data scientists are fighting for attention in an increasingly lucrative labor market.
So what exactly is IBM doing different with machine learning? Nov 1, 2017 • John Thomas

Machine Learning itself is not new. The concepts have been around for decades, and many companies have been building ML models and doing predictive analytics for a while. So what exactly is IBM doing in this space?
Tools for data science: Using the right ones for the job Oct 9, 2017 • Tim Bohn

You’ve probably heard the sayings many times: “You’re only as good as your tools.” and “Use the right tool for the right job.” These have never been truer than in the world of data science and machine learning.
Exploring insights on sepsis from medical literature Sep 18, 2017 • Vinay Rao Dandin & Avijit Chatterjee

Today, many industries are using machine learning to empower their businesses and solve unique challenges. And it’s not just businesses. The medical field is using machine learning to solve problems for the general betterment of humanity. Let’s walk through a medical use case on an infectious disease, sepsis, where we’ll use an unsupervised learning technique to discover interesting insights.
Text Analytics: Finding Insights in Scientific Publications Sep 11, 2017 • Ricardo Balduino

Can you read fast? Really, really fast? As you start reading this blog, consider that a person reads an average page of text in about 2 minutes, so it could take you about 10 minutes to read this whole post, more or less. Now imagine reading 10 to 20 pages of a scientific paper. Next imagine reading hundreds, thousands, or even millions of such papers. Not an easy — if even feasible — task for a single person or even for a group of avid readers. And even if a group of people can read many scientific publications in a reasonable time, how would they then combine their acquired knowledge and establish correlations between articles and terms of interest, find common patterns related to a specific subject, and so on?
Hello, Watson! A Bot-Building Tutorial Aug 27, 2017 • David Thomason

Whether we are part of a large business, a small startup, or a non-profit organization, there’s one objective that tends to arise when we seek to design and improve our organizations: providing our customers and users with an engaging experience. Although there are many ways to approach this objective, one of the recent trends to accomplish this is the rise of chatbots.
Unfriendly Skies: Predicting Flight Cancellations Using Weather Data, Part 2 Aug 14, 2017 • Tim Bohn & Ricardo Balduino

As we described in Part 1 of this series, our objective is to help predict the probability of the cancellation of a flight between two of the ten U.S. airports most affected by weather conditions. We use historical flights data and historical weather data to make predictions for upcoming flights.
Unfriendly Skies: Predicting Flight Cancellations Using Weather Data, Part 1 Aug 7, 2017 • Tim Bohn & Ricardo Balduino

This is the first in a series of blog posts where we’ll explore a use case and a few different machine learning platforms to see how we might build a model, using platforms that can help predict flight cancellations. In part one, we’ll talk about the use case, how and why we limited the scenario, and about the data we gathered to start the data science / machine learning process.
Ingest and Analyze Streaming Event Data at Scale with IBM EventStore Jul 31, 2017 • Siva Anne

In the connected world of today’s digital economy, apps, IoT devices, vehicles, appliances and servers are generating endless stream of event data. Event data is typically in small size per event but streams in at high speed generating huge volumes over time. The stream of events describes what is happening over time and offers the opportunity to track and analyze things as they happen. With access to complete sets of event data, businesses can derive critical insights from comprehensive analysis on all the data.
Walk Through a Data Scientist’s Experience with a Retail Use Case Jul 17, 2017 • Avijit Chatterjee

Data Scientists are compared to Unicorns, a rare and mythical creature with great powers. While one can find folks with machine learning skills, it’s extremely rare to find one who’s also a domain expert with a tremendous understanding of the business. The business landscape is changing fast today. One example is retail, an industry that is seeing tremendous disruption due to the shift from brick-and-mortar to online buying. Machine learning is heavily used in this industry to augment the customer buying experience with individualized just-in-time cross-sell and up-sell offers. Let’s walk through a retail use case. We’ll start by identifying the business problem and then we’ll use data science to find a solution.
Machine Learning & Apache Spark: A Dynamic Duo Jun 12, 2017 • John Thomas

The Machine Learning revolution is underway and is changing industries and delivering outcomes that were unimaginable a few years ago. In this video of John J. Thomas’s keynote at ApacheCon on May 17, 2017, learn how Apache Spark and other related projects are being used by innovative companies to remake products and services and enabling data-driven decision making.
Six Steps Up: From Zero to Data Science for the Enterprise Mar 19, 2017 • Seth Dobrin

Data has intrinsic value to the enterprise, but how to quantify these data assets has been a struggle for many organizations and for many enterprises as they establish modern data practices and data organizations. In most organizations, data in and of itself doesn’t have intrinsic value. The data’s value emerges only after we build platforms for data science.

Unfriendly Skies: Predicting Flight Cancellations Using Weather Data, Part 4 Nov 8, 2018 • Tim Bohn and Ricardo Balduino

Football Fan Sentiment Oct 22, 2018 • Ricardo Balduino

Leveraging Watson’s Machine Learning GPUs to accelerate your Deep Learning project in Python Aug 8, 2018 • Maxime Allard

Statistical Modeling and Machine Learning Applications for Time-Series Problems Jul 31, 2018 • Aakanksha Joshi

Reinforcement Learning: The Business use-cases Part 2 Jul 26, 2018 • Aishwarya Srinivasan

Reinforcement Learning: The Business use-cases Part 1 Jul 24, 2018 • Aishwarya Srinivasan

Predictive Maintenance Scheduling with IBM Data Science Experience and Decision Optimization Jun 20, 2018 • Yana Ageeva

DSX, GitHub and forking Jun 18, 2018 • Victor Terpstra

Using Machine Learning to Predict Outcomes for Sepsis Patients Apr 30, 2018 • Ricardo Balduino

GitHub for DSX projects Apr 14, 2018 • Victor Terpstra

The Journey to Digital: Part 3, Insight Transformation Apr 12, 2018 • Seth Dobrin

Using Excel to manage scenarios in Decision Optimization for DSX Apr 4, 2018 • Victor Terpstra

Don’t Let Data Science Become a Scam Mar 15, 2018 • Seth Dobrin

The Journey to Digital: Part 2, Data Transformation Mar 15, 2018 • Seth Dobrin

The Journey to Digital: Part 1, Table Stakes Feb 14, 2018 • Seth Dobrin

Raiders of Every Industry: Journey to Digital Feb 1, 2018 • Seth Dobrin

Hybrid Use Cases Dominate Machine Learning - Part 2 Jan 25, 2018 • John Thomas

Hybrid Use Cases Dominate Machine Learning - Part 1 Jan 16, 2018 • John Thomas

How IBM builds an effective data science team Dec 22, 2017 • Seth Dobrin

3 Scenarios for Machine Learning on Multicloud Dec 20, 2017 • John Thomas

Unfriendly Skies: Predicting Flight Cancellations Using Weather Data, Part 3 Dec 13, 2017 • Tim Bohn and Ricardo Balduino

What IBM looks for in a data scientist Nov 30, 2017 • Seth Dobrin & Jean-François Puget

So what exactly is IBM doing different with machine learning? Nov 1, 2017 • John Thomas

Tools for data science: Using the right ones for the job Oct 9, 2017 • Tim Bohn

Exploring insights on sepsis from medical literature Sep 18, 2017 • Vinay Rao Dandin & Avijit Chatterjee

Text Analytics: Finding Insights in Scientific Publications Sep 11, 2017 • Ricardo Balduino

Hello, Watson! A Bot-Building Tutorial Aug 27, 2017 • David Thomason

Unfriendly Skies: Predicting Flight Cancellations Using Weather Data, Part 2 Aug 14, 2017 • Tim Bohn & Ricardo Balduino

Unfriendly Skies: Predicting Flight Cancellations Using Weather Data, Part 1 Aug 7, 2017 • Tim Bohn & Ricardo Balduino

Ingest and Analyze Streaming Event Data at Scale with IBM EventStore Jul 31, 2017 • Siva Anne

Walk Through a Data Scientist’s Experience with a Retail Use Case Jul 17, 2017 • Avijit Chatterjee

Machine Learning & Apache Spark: A Dynamic Duo Jun 12, 2017 • John Thomas

Six Steps Up: From Zero to Data Science for the Enterprise Mar 19, 2017 • Seth Dobrin