I last got an update from Dataiku in November of 2014. Since then they have raised money and opened an office in New York. New features and capabilities have been added to the product and they are seeing good interest in the product from US customers as they expand here. The 2.0 version has been [...]
mapreduce
While R has become very popular in recent years the fact remains that as an open source product it has some scalability and performance issues (discussed in our paper on Standards in Predictive Analytics for instance). Base open source R is not really designed for the kind of large data volumes, Big Data, that are [...]
My final session of the event is George Mathew’s General Session on the product and its roadmap. George recapped Alteryx’s laser focus on data analysts – who make up a big chunk of the audience here – and their focus on making the next best decision. It’s great to see the focus on decision-making not just [...]
Marie Wieck came back to discuss how to create the kind of composable business IBM discussed on day 1. It’s critical, she says, to keep customers at the center and drive a new process from this. It’s key to use data to make better decisions, and it’s important to have the kind of scalable infrastructure [...]
I first got a briefing on Dulles Research Carolina back in 2010 and I recently got an update. Dulles Research was founded in 2005 and came to market with Carolina, their SAS to Java convertor, in 2009. A U.S. patent was awarded to Dulles last year for the SAS to Java conversion process. At its [...]
I last got an update from Rapid Insight back in March of 2012. Since then they have made a number of updates to the products. To recap, Rapid Insight Veera is a data intelligence tool for creating analytical processes and modeling datasets (supporting an extract, transform, analyze metaphor) and Rapid Insight Analytics is a data [...]
In this series so far we have discussed a number of standards – R, PMML and Hadoop – that are well established. There are also some future developments that are worth considering—the emergence of the Decision Model and Notation standard, growing acceptance of Hadoop 2 and planned updates to PMML specifically. As regular readers of [...]
With the fourth post in this series I am going to talk about Hadoop – something with even more hype than R or predictive analytics. As we all know the era of Big Data has arrived. As anyone who reads the IT or business press knows, there is more data available today, this data is no longer [...]
I am working on a paper, for publication in early 2014, on the role of standards such as R, Hadoop and PMML in the mainstreaming of predictive analytics. As I do so I will be publishing a few blog posts. I thought I would start with a quick introduction to the topic now and then [...]
FICO has just released Model Builder 7.4, the first “Big Data” release of their analytic modeling tool (I reviewed 7.2 here and the 7.4 press release is here). Four things make this release: Text analytics and semantic scorecards High Volume Processing with Hadoop – including algorithms moved to MapReduce R Integration And drifting away from Big [...]
As part of some ongoing research on support for PMML I recently spoke with Concurrent. Concurrent is an enterprise software company focused on simplifying Big Data development on Hadoop. The company’s core product is called Cascading. This is a free, open-source, development framework for Apache Hadoop designed to let developers build sophisticated data processing applications [...]
Oracle Advanced Analytics is a new Oracle database option (announced today) that bundles Oracle R Enterprise and Oracle Data Mining (reviewed previously). With this release, R becomes a first class native interface for the Oracle database along with SQL and the graphic interface that ships with Oracle Data Mining. This allows analytic modeling code to [...]
Ideate is an Application Framework from a company called Consilience International that was started about 2 years ago by a couple of process folks looking to do something that was more suitable for highly evolvable, dynamic environments where runtime adaption was important. These kinds of dynamic applications are increasingly a focus for companies. Consilience’s view [...]
Teradata announced its intent to acquire Aster Data today. Obviously this is big news in analytics-land and I participated in a call where the two companies gave some quick information. The driver for the acquisition seems t be an increasing focus on generally unstructured and untapped data and expanding the Teradata portfolio into this adjacent [...]
Mike Hoskins came on to wrap up the event after handing out some awards. His focus was on integration horizons, of which he identified three: Integration Management This has not historically been a focus for Pervasive – they have been focused on the engines and technical details. The new stack, version 10, has clearly been [...]
Jim Falgout of Pervasive DataRush presented on best practices for custom big data applications. I have spoken to Jim before, when I reviewed DataRush. Jim sees the big data challenges as being driven by both complexity (high performance computing problems like fluid dynamics, climate modeling) and data size (internet scale data for web indexing and [...]
Syndicated from SmartDataCollective Quick update from Teradata to kick off the day focused on Active Enterprise Intelligence. This remains a key theme for Teradata, unsurprising given the focus of Teradata customers on an enterprise data warehouse full of operational data. AEI is about a focus on moving from the back office to the front office [...]
in2clouds is focused on helping companies use Predictive Analytics to improve their business performance. Founded by MicroStrategy alumni and launched in 2009, in2clouds is a small company that has been working in hi-tech, financial services and retail. Seeing analytics as “the next big thing” they want to reduce the friction for mainstream adoption and help [...]
I listened in to the Boulder BI Brain Trust briefing from Aster some weeks back and then got a follow-on update last week. Aster was founded in 2005 based on research performed by a team at Stanford. The initial plan was to develop a data management platform based on commodity hardware and this was released [...]
I got my first chance to catch up with the folks at Cloudera recently. Founded in 2008 Cloudera has nothing really to do with “cloud” and focuses instead on “big data” – helping organizations capture, integrate and analyze new sources of detailed business data. Cloudera like to describe themselves as the RedHat for Hadoop – [...]