≡ Menu

hadoop

I am attending the Teradata Influencers Summit in lovely Del Mar, California. First up is Oliver Ratzesberger, Senior VP of Software, to talk about recent technology innovations and technology strategy. Oliver highlighted some of the key trends and themes for Teradata. He began with a maturity model for “the sentiment enterprise” (video here): Agile Data [...]

As part of my ongoing series of interviews with analytic practitioners I caught up recently with Matthew Kitching, Senior Data Scientist at Apption. I am giving a webinar with Matt Kitching, Senior Data Scientist at Apption at 10am Pacific, May 27 on The Value of Predictive Analytics and How Using Decision Modeling Helps You Succeed. Apption is [...]

RapidMiner 2015 Update

It’s been a while since I last got an update on RapidMiner (RapidMiner 6 was the last version I reviewed) and they have some new positioning and product capabilities I wanted to catch up on. RapidMiner began as an open source product company founded in 2007. They moved to Open Core in 2010 and now [...]

Dataiku, a company founded in 2013 and based in France, launched their product, Data Science Studio (DSS), in February 2014. DSS is a web-based analytic software platform designed for data scientists and analysts. The product is designed to improve the effectiveness and productivity of data teams especially when it comes to turning raw data into [...]

Gregory over at KDNuggets had an interesting post with some Top Analytics and Big Data trends ahead of Strata Hadoop NYC Conference based on input from their readers. Three trends struck me: The challenge of communicating complex analyses to non-technical clients/partners We are having increasing success using logical decision models to show how data and analytics drive better [...]

Actian has recently released its new Actian Analytics Platform (blogged about here) to deliver scale, parallelism, advanced analytics etc. Following on from this they announced a new offering, Clear Path. The Clear Path Program is designed to provide blueprints to help customers adopt this platform and deliver “transformational value”. Clear Path Analytics Blueprints package up [...]

First Look: Teradata Aster R

While R has become very popular in recent years the fact remains that as an open source product it has some scalability and performance issues (discussed in our paper on Standards in Predictive Analytics for instance). Base open source R is not really designed for the kind of large data volumes, Big Data, that are [...]

My final session of the event is George Mathew’s General Session on the product and its roadmap. George recapped Alteryx’s laser focus on data analysts – who make up a big chunk of the audience here – and their focus on making the next best decision. It’s great to see the focus on decision-making not just [...]

TIBCO Spotfire 6.5 (announced last month) has new capabilities around easier access to data, location analytics and R. To support effective data discovery and visualization, TIBCO believes it is essential to allow users easy access to analytics against the increasingly wide range of data sources that are available. It’s not enough that the visualization tools [...]

Zementis, the company behind the ADAPA PMML engine (last reviewed here and a sponsor of our Standards in Predictive Analytics research), has been busy making their technology for deploying models based on the Predictive Model Markup Language or PMML more pervasive. They have launched on the Amazon AWS Marketplace. While they have supported AWS for [...]

SAS® Model Manager is getting an update soon to release 13.1 (I last blogged about Model Manager 3.1). The vision of SAS Model Manager going forward is to streamline the integration of predictive modeling into the overall environment, make it easier to operationalize analytical models, expand the model portfolio management capabilities and improve governance and [...]

SAS® Enterprise Miner recently got a major release – 13.1 – focused on machine learning, scalability and productivity. It’s been a while since I blogged about SAS Enterprise Miner (last review here) so this might not be a complete list of the improvements since then. The machine learning focus added High Performance Support Vector Machines [...]

SAS is upgrading its in-memory analytics products with SAS® Visual Statistics (forthcoming) and SAS® In-Memory Statistics for Hadoop. SAS In-Memory Statistics for Hadoop is available now and SAS Visual Statistics is going to be shipping in July of 2014. SAS Visual Statistics is based on the SAS® LASR™ Analytic Server for in-memory processing and is [...]

First Look: SQLstream 4.0

I last wrote about SQLstream back in 2012 and got an update from them recently. Recent news includes a partnership with Oracle and new performance benchmarks (against Hadoop Storm for instance), and their latest SQLstream 4.0 release. New 4.0 features include Performance optimization of the streaming integration layer and the UDX mechanism for SQL extensions [...]

I first got a briefing on Dulles Research Carolina back in 2010 and I recently got an update. Dulles Research was founded in 2005 and came to market with Carolina, their SAS to Java convertor, in 2009. A U.S. patent was awarded to Dulles last year for the SAS to Java conversion process. At its [...]

I got an update from Actian about their new announcement recently. Actian recently acquired ParAccel (reviewed here) and Pervasive (see my reviews of RushAnalyzer and DataRush) and for 2014 they are bringing all this IP together into a new Actian Analytic Platform. Actian is now a $140m company with strong profits and 10,000+ customers including [...]

First Look: Rapid Insight Update

I last got an update from Rapid Insight back in March of 2012. Since then they have made a number of updates to the products. To recap, Rapid Insight Veera is a data intelligence tool for creating analytical processes and modeling datasets (supporting an extract, transform, analyze metaphor) and Rapid Insight Analytics is a data [...]

To wrap up the series I have been writing on standards in predictive analytics, here’s the report I have been working on. This report discusses each of the topics in the series – R, Hadoop and PMML – in more detail and pulls it all together in a single paper. You can get the Standards in Predictive [...]

In this series so far we have discussed a number of standards – R, PMML and Hadoop – that are well established. There are also some future developments that are worth considering—the emergence of the Decision Model and Notation standard, growing acceptance of Hadoop 2 and planned updates to PMML specifically. As regular readers of [...]

With the fourth post in this series I am going to talk about Hadoop – something with even more hype than R or predictive analytics. As we all know the era of Big Data has arrived. As anyone who reads the IT or business press knows, there is more data available today,  this data is no longer [...]