≡ Menu

Open Source

Continuing in the analyst program at IBM World of Watson we got an update on the evolution of IBM's Cloud Platform from Bill Karpovich. Cloud, he says, is strategic. Vendors must disrupt with cloud or be disrupted by it. Over the last few years, cloud has evolved from efficient public cloud infrastructure to new applications [...]

One of the announcements at IBM's World of Watson is of the new Watson Machine Learning Service. I got a chance to ask a few questions about this new capability. A couple of key elements emerged regarding the current platform and the immediate announcement. First, some context. The Watson Data Platform (also announced at World of [...]

Second keynote at IBM's World of Watson is on the role of cloud and data as the foundation for cognitive systems and businesses. Bob Picciano from IBM's analytics business kicked things off. Bob began with a discussion of how IT's value has changed - from a focus on processing and how fast/cheap it can be done to a [...]

I recently got a chance to catch up with the IBM SPSS team for an update. Analytics, in IBM’s view and mine, are increasingly necessary as digitization increases the scale of business data and digital disruptors increase the difficulty of making good decisions. For those being disrupted analytics offers a powerful way to fight back. [...]

Last session of the day is a freeform executive Q&A so I will just list bullet points as they come up: Open Source R is obviously a hot topic in analytics and SAS' focus on more open APIs that are broadly accessible and their renewed focus on academic partnerships are designed to "leave R in the [...]

I am giving a webinar on Predictive Analytics Deployment to Mainframe or Hadoop with Mike Zeller of Zementis on Thursday, March 3, 2016 11:00:00 AM PST The big challenge for analytics-driven organizations today is closing the gap between deriving an analytic result and getting the ROI. Organizations need a consistent and efficient way to deploy analytic results [...]

I have just finished updating Enterprise Scale Analytics with R with new data from the Rexer Analytics Survey for 2015. As R has become more popular, the role of analytics has become increasingly important to organizations of every size. Increasingly, the focus is on enterprise-scale analytics—using advanced, predictive analytics to improve every decision across the [...]

Final session today focused on systems and architecture for Big Data Analytics. It began by talking about the friction between business and IT and how this is increasing, especially around information and analytics where business users want to be able to work with data without worrying about IT. This creates challenges for IT specifically: The [...]

OneDecision, DMN and DecisionsFirst

Those of you who follow the news on the new Decision Model and Notation (DMN) standard will have seen that we have become a supporter of the new OneDecision.io open source initiative. OneDecision is working on making it easy to execute a well defined DMN decision table - essentially taking a DMN interchange file defined [...]

First Look: Dataiku 2.0

I last got an update from Dataiku in November of 2014. Since then they have raised money and opened an office in New York. New features and capabilities have been added to the product and they are seeing good interest in the product from US customers as they expand here. The 2.0 version has been [...]

Teradata Listener #TD3PI

Continuing at the Teradata Influencers event with a discussion of Teradata Listener. Teradata Listener provides real time streaming data support., intending to make it easy to allow multiple projects in an enterprise to listen to streaming data, capture it and make it available for later analysis. It's based on open source and software only. It [...]

Day two of the Teradata Influencers event started with Hermann Wimmer, co-president. Hermann gave us an introduction to the overall company strategy. Teradata has two co-presidents, one focused on data and analytics and one focused on marketing applications with some shared services. This allows  them to focus on the three markets there see for their [...]

I am attending the Teradata Influencers Summit in lovely Del Mar, California. First up is Oliver Ratzesberger, Senior VP of Software, to talk about recent technology innovations and technology strategy. Oliver highlighted some of the key trends and themes for Teradata. He began with a maturity model for "the sentiment enterprise" (video here): Agile Data [...]

RapidMiner 2015 Update

It’s been a while since I last got an update on RapidMiner (RapidMiner 6 was the last version I reviewed) and they have some new positioning and product capabilities I wanted to catch up on. RapidMiner began as an open source product company founded in 2007. They moved to Open Core in 2010 and now [...]

I recently did some research on the requirements for enterprise-scale analytics and the challenges of using open source R in this context. In my first post (Requirements for Enterprise Scale Analytics with R - Part 1) I outline some of the requirements I see for enterprise scale analytics. In this second post I will discuss the challenges of R [...]

I recently did some research on the requirements for enterprise-scale analytics and the challenges of using open source R in this context. In this first post I wanted to outline some of the requirements I see for enterprise scale analytics and in a second post I will discuss the challenges of R in that context. As advanced [...]

Webinar: Up Your R Game

I am giving a webinar Up Your R Game with Bill Franks of Teradata on July 29th at 1pm Eastern time: Open source R is no longer a fad or a tool used in the academic world. According to the Rexer survey,* R is the analytic software of choice for data scientists, business analysts, and data [...]

With the fourth post in this series I am going to talk about Hadoop - something with even more hype than R or predictive analytics. As we all know the era of Big Data has arrived. As anyone who reads the IT or business press knows, there is more data available today,  this data is no longer [...]

The third post in my series on standards in Predictive Analytics is on R, a hot topic in analytic circles these days. R is fundamentally an interpreted language for statistical computing and for the graphical display of results associated with these statistics. Highly extensible, it is available as free and open source software. The core environment [...]

I am working on a paper, for publication in early 2014, on the role of standards such as R, Hadoop and PMML in the mainstreaming of predictive analytics.  As I do so I will be publishing a few blog posts. I thought I would start with a quick introduction to the topic now and then [...]