Calendar heatmap – “Growing a company”

 

Data visualization is worth a thousands words.

1.) 2008, 10+ engineers in a small development company doing Hadoop projects.

2.) 2009, November we founded Datameer and raised series A.

3.) 2010, developing the product, private beta, launching Datameer version 1.0 in winter.

4.) 2011, selling the product and doubling the company in less than 6 months.

5.) 2012, focus on execution and growth of the company with regular department meetings.

Posted in Company Culture | Leave a comment | RSS

Welcome, Oracle. Seriously.

.

Welcome to the most exciting and important enterprise software market in the last three decades.

And congratulations on your first Hadoop appliance. Despite your original skepticism, putting real computing power in the hands of everybody is already improving the way people store, process and analyze data.  This will have a major impact on our society.

When we built the first Hadoop systems, we estimated that hundred of thousands of business would justify the investment in Hadoop once they understood the benefits.

Next year alone, we project that over 10000 new businesses will come to that understanding. Over the next decade, the growth of big data and Hadoop will continue in logarithmic leaps.  Hadoop literacy is already becoming as fundamental a skill for computer engineers and data analysts as Java or SQL.

We look forward to responsible competition in efforts to distribute this a global open source technology to the world. And we appreciate the magnitude of your commitment.

Because what we doing is increasing social capital by enhancing organizational and business capabilities to gain new insights and better understand the world.

Welcome to the task.
The Hadoop Community.

:)

P.S. Here is the original.

Posted in Big Data Analytics Perspectives | Leave a comment | RSS

Will you beat Santa to Christmas morning?

It’s that time of the year again when travelers drag themselves to the airport to get on a plane and hopefully meet their loved ones at their destination in a reasonable amount of time.

Thanks to publicly available FAA data, Datameer ran some analytics and found these interesting insights from past Christmas holidays. The data comes to us from the Research and Innovative Technology Administration (RITA) and has detail on everything from individual airports to the specific reasons why delays might happen.

Using Datameer, we generated a spline graph of the average number of incidents over an 8-year period. If you’re traveling on the 21st or 22nd (and to no one’s surprise), anticipate a sea of people at your airport, and possibly delays – these are the peak travel days. Surprisingly, if you’re traveling on the 24th or 25th, most people will have already completed their travels and your journey might be a little less stressful.

So what does this look like when we look at individual airports and destinations?  Using a circular graph chart, we can see the most delayed routes by the frequency of delays below. We set this up in Datameer by first mapping out the travel routes in the worksheet, grouping them and sorting by the highest count in descending order.

We can see from the diagram and illustrated by the thickness of the line above, routes between Chicago O’Hare, Newark, and LaGuardia were most heavily delayed. We know, of course, that Chicago is the largest hub in the US so there’s no surprise there. But what IS surprising is that if we look at the top number of incidents due to weather, Dallas took the top spot in number of incidents in December beating out Chicago and Atlanta, as shown in the diagram below.  So if you’re flying from or through Dallas, you might want to pack some extra reading materials.

And if you’re curious as to which travelers have spent the most time waiting in December, San Francisco travelers get that unfortunate title. Below we’ve taken the total number of minutes spent waiting to arrive at a particular airport by summing up the delayed-minutes field by destination.

Comparing this graph to the one above, we could deduce that SFO has lengthier delays and DFW has shorter but more frequent ones and the data confirms that this is the case.

So, while we don’t expect you can alter your travel plans, we hope you find these insights informative.  There is a wealth of publicly available data like this FAA dataset that can provide valuable insights.  And, Datameer offers a free trial of our solution here that you can use to analyze and explore this and virtually any other data.

Safe travels everyone. Happy Holidays from Datameer.

Posted in Big Data Analytics Perspectives | Tagged , , , | Leave a comment | RSS

Well hydrated

 

We at Datameer strongly believe that some of the most important answers to mankind’s challenges are hidden in data. We hope we can contribute to the research efforts of science and academia by building and making available an amazingly simple, yet powerful data analytics platform that enables analysts and scientists to focus on their research without having technology get in the way.

We also believe that lessening our own human impact is key to our future and requires many small steps. So here at Datameer, along with the obvious recycling, we have embraced a culture of low impact, sustainability and going green whenever possible.  To start, we installed energy saving light bulbs in our new offices and banned all Styrofoam containers from our daily lunch deliveries.

One more impactful change we recently made is to stop buying soda and water in plastic bottles. Instead, we provided all staff with beautiful stainless steel water bottles and added filters to our tap water. We used to generate hundreds of empty soda cans and plastic bottles each month, but since we made this minor change, we have been able to cut this waste down dramatically.

 
We’ve also purchased a great soda stream machine to enable us to invent our own organic house made sodas. Turns out, this not only had a green impact but also reduced our spending. On average, we used to spend more than $2 a day per person to supply bottled water and sodas. But after less than a month, the money saved already paid for the stainless steel water bottles and the soda stream machine.

With these simple changes, we will save more than $10,000 a year just in our San Mateo office alone.  More importantly, we are not creating trash in the environment and filling up our landfills with thousands of plastic bottles.

So, go and buy some stainless steel water bottles for your company!

Posted in Company Culture | Tagged , , | Leave a comment | RSS

The Changing Data Landscape, Ready or Not, I’m in for the Ride

 

Post from Dianna Doan, Senior Manager, Marketing Programs

The world is changing, whether we’re ready for it or not.  I spent a number of years in the world of traditional BI with Actuate so I understand the BI space. But few could and would predict the changes that have taken place in the last few years.

What’s causing this change? Its really driven from the likes of Facebook, Twitter and smart phones, the rise unstructured data. This is redefining the world in which we live in. As Gen Y’s begin to take over the social blogosphere, we will see the changes multiply. Retail chains, banks, government, all are plugging into the world we know as ‘big data’.  I joined Datameer recently to dive into this new big data world and the gold mine created for researchers, business analysts and marketeers wanting to make sense of it all.

As I begin to understand the intricacies of what this big data ecosystem is really all about, from Hadoop, to the Hadoop distributors, to Datameer, I realize big data has opened us up to a world of very interesting and often mind blowing analysis and insight.

For example, I started reading my friend Jonathan Chang’s thesis “Understanding, Uncovering and Predicting Links” to understand more about machine learning to prepare me for the job here at Datameer.  He’s a Data Scientist at Facebook, so naturally, his thesis seemed like an interesting extension to what Datameer does, (BI platform on top of Hadoop for big data).

While reading, I stumbled upon some really cool insights that got me excited about what the future holds for the players in this space. Big data is a beast.  It’s wide, it’s expansive and will intersect all of us at some point in our lives over the next few months, if not days.
Allow me to share snippets of Jonathan’s thesis.  Below is a simple, but insightful pictogram of how people are related, made possible by Hadoop.

Figure 1.1 shows what a subset of an online social network might look like. The nodes in the graph represent people and the edges represent self-reported friendship between members. Even in this simple example, a rich structure emerges with some individuals belonging to tightly connected clusters while others exist on the periphery. Characterizing this structure has been one major thrust of network research (Newmanet al. 2006b).

Going beyond the ‘social networks’, and without getting too technical, companies can now see how one person is related to another by building predictive and probabilistic models leveraging software companies like Zementis and their predictive modeling plugins to Datameer to understand and predict consumer buying behavior.

I then started reading earlier posts from my colleagues here at Datameer like  “Predicting the stock market with Datameer or “Fishing the Clickstream…” and my mind started racing.  What other insights can companies gain?  What are companies doing if they aren’t already tapping into big data?  It seems that the way companies look at customer and prospect information and interaction has changed forever.  Without giving away any secrets, I’d love to hear about what insights companies are now capturing in their big data analytics….

 

Posted in Big Data Analytics Perspectives | Tagged , , , , | Leave a comment | RSS

Why I am at Datameer

 

Post from Brian Smith, Regional Director of Sales

I ran the commercial sales division at Vertica Systems for several years prior to joining Datameer. At the risk of going on (and on), I wanted to share my enthusiasm around my professional experience selling big data solutions, and why I am so excited to be at Datameer.

Circa 2011 – We’re all producers and consumers of data in almost every aspect of our professional and personal lives. Analysts anticipate a compounded 40% per year growth rate in corporate data volume, with the lion’s share of the growth in unstructured data. In 2011, the name of the game in big data is Hadoop, and it’s become the Gold Rush of interest. Why?

The economics are compelling – Hadoop is moving out costly analytic databases and warehouses, driving IT to re-look at ADBMS sales cycles, shifting IT dollars and vendor roadmaps, and generally wreaking havoc in the traditional vendor community. We’ve gone from one or two distributions to nine in the last year! And, literally every vendor in the BI/DBMS space has a Hadoop connector, the latest being the recent Oracle announcement. Everybody is on board this train – All this based upon the premise of unlimited scale and data variety at a fraction of traditional costs.  Technical challenges exist, but its clear that there’s a sea change.

Prevailing winds – In light of large corporate BI and data warehouse investments, companies are typically using Hadoop as a staging and storage area to start… This involves parallel loading high volumes of raw enterprise data into Hadoop for subsequent scrubbing, (ETL), in route to passing smaller subsets for analysis to the existing BI stack through connectors.

There are several reasons why this approach will be short-lived:

  • Business analysts in this process are by definition separated from the data that they need to do their job – The end product that they receive is a limited subset of available data for analysis, simply because traditional BI cannot handle the volumes of data.
  • Time value of data – the development/IT resource required to ETL the data and move/process the subsets introduces unacceptable delays. People have jobs to do and analyses to rely on…
  • Cost – at each stage of this process, costs are created and duplicated across developer, vendor license, support and training , only to yield a partial answer based on a fraction of the data.  It’s incredibly inefficient and complex.

Conclusion?  Where such obvious inefficiency exists in a business segment where literally billions of dollars are spent every year to achieve business efficiency (!), alternative solutions will rapidly emerge…That’s why I am at Datameer.  (“Data Ocean” in German)

“Datameer is the first BI/Analytics platform built natively on Hadoop.”

On the surface it sounds interesting, but in practice the solution is game-changing.  The Datameer Analytic Solution (DAS) connects business users directly with the entire volume and variety of their raw Hadoop data and makes it available for comprehensive analysis.

DAS does this in a way that any business user will find to be simple, efficient, and completely intuitive. IT will love it as well, as Datameer frees up the IT folk to focus on more value add work rather than writing code to get data into Hadoop.

Practice makes perfect…Datameer gives business users an iterative “prototyping” capability for the data pipeline against a small sample of the data prior to running production analytics against the Hadoop cluster. (IT guy smiling…) It’s simple, practical, and directly in line with how the business needs to access, analyze and consume data – all without stepping on IT’s toes.

Under the covers, DAS generates Java/MapReduce code that runs natively on the Hadoop cluster. All current Hadoop distros are supported – we’re Switzerland when it comes to platform support for Apache, Cloudera, MapR, IBM and the rest, we run all of it in a browser on Windows, Mac and Linux.

DAS is an open book at every stage of the data pipeline, with plug and play support at each phase – integration, analysis and visualization. So you can pick and choose,  plug in your own custom analytic functions, use your visualization tool of choice, or simply use DAS end-to-end as an integrated stack.  What’s not to like?

I attended an Hadoop user meeting in San Francisco a few weeks back when I first joined Datameer. One of the moderators in the small group discussion was a local IT/Development manager responsible for supporting a 200 node Hadoop cluster. He made a very telling comment:

“The other day, one of my business guys asked me why I couldn’t just ship him a half a petabyte of .csv files?”

Game on!

Posted in Uncategorized | Tagged , , , , , | 1 Comment | RSS

Whose Hadoop is Bigger? Really…

 

Guest post from CEO Stefan.

Actually, Datameer contributed the most code to the Hadoop ecosystem.

After weeks of hard work we made a surprising discovery, actually 99% of the Hadoop ecosystem code was written by Datameer engineers.

We hired an external, independent, respected analyst firm that had 5 PhD’s working on a new generation of algorithms that analyze the Hadoop ecosystem source code, jira posts, emails and ideas contributed in verbal conversations.

The breakthrough was that we analyzed the object inheritance and the call stack to weight the importance of each line of code. We also took the mental stability of contributors into account. BTW, if you still wondering what was in my coffee this morning, you don’t get my German sense of sarcasm.

Well….

When I joined the Nutch project in the early 2000’s, I was known to communicate my strong points of views very loudly in the community. I guess I lost some steam over the years, I have not even published a blog post in last few years and the Hadoop & Co mailing lists are on read only subscription.

But I felt I had to speak up about all this commotion around “my Hadoop is bigger than yours” currently lighting up the community.

I tried to take some wind out of this conversation over the last few months by using our product to analyze the Hadoop source code and present, in a very fun way, some Hadoop source code insights here. These analytics discovered the longest email conversation for the smallest code change or longest commit comment for the shortest change, etc etc.

So now we find our partners and friends sparring over whose contribution is bigger than the others. Frankly, this is all surprising to me since we have so much more work to do to move Hadoop forward. Don’t get me wrong, we love Hadoop for what it is but we all can agree that the code is still a work in progress, monolithic, difficult to test and concepts like inversion of control do not exist… I could go on for a while.

So actually I’m happy to announce that our own awesome engineering team is not responsible for this but instead focused on working on a great analytics product on Hadoop that brings great value to our customers.

Here at Datameer we work hard but also make sure we have a good time including sharing a laugh over the most stressful situations.

In that spirit we would love to contribute a laugh to the ongoing “civil war” in the Hadoop ecosystem.  To commemorate this epic discussion, we have designed a special t-shirt that we would love to share free with the community.

Just fill out the form and we send you your own free shirt (one shirt per household, while supplies last).

Send me one of these cool “My Hadoop is Bigger Than Yours” t-shirts.

Ok, people, now back to work – lets build some great technology instead of arguing about lines of code.

P.S. We have some customers using DAS, their Hadoop is for sure bigger than yours. :)

Stefan Groschupf

Posted in Uncategorized | Tagged , , , , , | 3 Comments | RSS

Predicting the stock market with Datameer

I recently read an interesting research paper by Johan Bollen, Huina Mao, Xiao-Jun Zeng, from Indiana University entitled “Twitter mood predicts the stock market,” that investigated whether “collective mood states derived from large-scale Twitter feeds” correlated with the value of the Dow Jones Industrial Average. What they found was that their algorithm not only paralleled market changes, it predicted them, with startling 87.6 percent accuracy!

As a provider of Big Data analytics software, we see this type and scale of problem all the time at our customer sites, particularly the correlation of structured and unstructured data.  For this particular study, let’s see how easy it is to reproduce this analysis with Datameer Analytics Solution (DAS).

First, let’s download the Dow Jones stock values data. You can get this freely, from Yahoo for example (DJIA). This is a simple CSV file format showing daily prices. You can also download other data, such as the NYSE Composite index, to experiment with.

Second, let’s get some Twitter data from their API, known as the “fire hose”.  For this test, we’ll use raw data (i.e. unfiltered tweets) for the entire month of March 2010.

Let’s load all of this data into DAS.  In our new 1.3.x version, you can simply upload a file from your local computer, so let’s load our Dow Jones data this way:

Upload file

Then let’s load the tweets, via an Import Job, which understands Twitter’s format natively:

This amounts to about 30 GBs of compressed data for the month.

Let’s first try and get a more accurate data set, by filtering the tweets to US users. This is something that our researchers apparently did not do: “we note that our analysis is not designed to be limited to any particular geographical location”, but this is easy to do with DAS.

We did not have OpinionFinder nor Google-Profile of Mood States at our disposal to perform sentiment analysis (these could make great new functions some day that could be added via our API!), so let’s use instead a simplified version by taking a list of positive terms (Bag of words model), and find the tweets that contain these terms.

To do this in DAS, let’s import a list of such terms (this can be easily found on different web sites), and create an outer join with our tweets, and then filter to find the tweets that contain these positive words.

In DAS 1.3.x you can filter with a complex expression directly in the ‘Advanced’ tab:

Now let’s count the positive tweets per day. This is just an aggregation sheet using GROUPBY and counting (this is the sheet preview result below, not the actual count on the full data set yet):

This represents the amount of “happiness” mood by day.

Next, let’s create a new workbook to join the resulting worksheet of “happiness” mood per day with our Dow Jones Industrial Average (ticker ^DJIA) data:

A very helpful feature in DAS is the fact that we can seamlessly exchange the DJIA history with, say, NYSE Composite index history via the ‘Exchange Datasource’ button and rerun the workbook to test the correlation with data from other exchanges. This requires no further changes or additional work (more details on this later).

Here is the resulting sheet of our join:

As you may know, building analyses in DAS works on a sample of the entire data set, which enables users to easily interact with the data until they’re satisfied with the analysis.   Now that we are happy with our analysis, let’s run the workbook on the entire data set.

Now let’s graph the tweet “happiness” mood and the DJIA market closing value over the same days and compare:

As the researchers pointed out, we can note a correlation between our Twitter “happiness” index, and how the Dow Jones Industrial Average went up or down between two and six days later; first see the progressive parallel mood upswing (1), then the drop on (2) (drop in Twitter mood followed by drop in the DJIA value), an upswing again on March 19 at (3) – Twitter mood goes up quickly followed by the DJIA value -, then a parallel drop on March 22 followed by the same drop in DJIA value a few days after  (see (4)). A similar correlation can be found by using NYSE data instead.

Disclaimer note: this analysis was done on only a month’s worth of data, but could be expanded to more data very easily with no further changes in the analysis. We also used a very simple technique of sentiment tracking, which could be further improved. Finally, due to the small amount of data, we did not have outliers in the data like our researchers did (“significant socio-cultural events such as the Presidential election and Thanksgiving, short-lived uptick in positive sentiment specific to those days”), but we could easily filter had we worked with more data.

Pretty easy, wasn’t it?  That simplicity: combining all kinds of data ad hoc, while harnessing the power and scalability of Hadoop to extract insights, is what Datameer is all about.  I hope you’ve enjoyed this post, and you can learn more about Datameer at www.datameer.com/products.

Posted in How-to, Uncategorized | Leave a comment | RSS

Fishing the Clickstream…

 

Firstly, I’m excited to announce that there’s a major new release of DAS (1.3) available.  1.3 includes, among other things, some powerful tools to perform clickstream analysis through just a few simple steps, and makes visualization of user behavior a breeze.  I wanted to give you a overview of these new tools, and provide some food for thought on how simple it is to extract meaningful insights into visitor behavior from raw web logs, a common use case for DAS and Hadoop.

The goal here is to be able to scrape raw log files from your Apache or IIS web servers and visualize something like this:

This new visualization in DAS, called the “Circular Connection Graph” tells us the relative density of one-hop clickpaths.  It’s an easy way to measure and visualize click-through rate (CTR) from various campaign landing pages, or to compare the popularity of referring web sites (i.e. marketing partners who drive traffic to your site). But this is just one small fish in the sea of weblogs (see what our customers say about the importance of behavioral analytics).

The real magic for Hadoop and DAS is that this data, when enriched with visitors profile or other interaction data (think: MySQL, Oracle, Teradata, Twitter), can give you fine-grained, visitor-level insights previously out of reach.  Canned web traffic reports from a traditional application might only give you aggregated data; cloud-based analytics solutions might show you detail in the clickstream, but can’t correlate that behavior with the transaction systems of record that track the rest of the customer lifecycle, namely: purchases, balance history, call center interactions or in-store visits.  There’s more about that here.

Let me show you a bit about what I mean.  With standard web analytics packages, you can easily get answers to the basic questions of web behavior (including popular pages, session duration and clicks per session) with canned reports.  These are straightforward aggregations (roll-ups) which are easily done in DAS, and much easier than in raw Hadoop, where you’d write Hive QL, Pig or MapReduce code.

Here’s a few examples of those (click the images if you’d like a larger view).


Thanks to the game-changing economics of Hadoop, you can always afford to save every click.  What does that mean?

1. Raw server logs can be fed into Hadoop, eliminating a separate ETL, modeling or pre-processing stage in the data pipeline.  With DAS, this requires zero coding.

2. Using DAS, key elements of user behavior; not just session stats, but page dwell time and click paths preferred by specific users, can easily be extracted and sliced on any dimension.  That provides insightful stats like what you see below. It could also mean dense visualizations like the one at the top of this post, which can serve up daily insights to the folks responsible for customer acquisition or marketeers managing campaigns.


DAS also gives you flexibility.   First, it separates the wheat from the chaff.  Filtering errors, image requests and page refreshes from the clickstream is simple.  Second, DAS let’s you divide-and-conquer the data pipeline.  Data warehousing expertise can be applied to cleanse, enrich and pre-process the data (e.g. sessionizing traffic your own way, with any timeout), which can then be fed on a platter to the BI and marketing teams to create roll-ups, or to data scientists to look for clusters of visitors or develop predictive models. Finally, you can go wild and join this with anything you can throw at DAS: user profile, demographics, emails from your CRM, Twitter feeds, or last month’s blog post.  Sound like a fantasy?  All you need is a handful of spreadsheets and an imagination.  Click to zoom in on the screenshot below to get a taste.  Or wait for the video I’ll be posting soon.

This is clearly a rudimentary example of clickstream analytics, but it’s a starting point that contains valuable nuggets of insight, and it’s easy to extend.  Most importantly, it makes this machine-generated data accessible.  And that’s what data science is all about.

Want to get started today? Contact us for a free trial download, VMWare, or turnkey instance in the cloud.

Happy fishing!

Posted in Announcements, How-to, Uncategorized | Tagged , , | Leave a comment | RSS

It’s About Time…

 

In this post, I’ll show you a thing or two about the powerful capabilities of DAS to perform time series analytics.

Most analysts swimming in today’s sea of unstructured data are poorly served, receiving only daily or weekly canned reports which provide a course, aggregated view of what’s happening within their data.  These reports lack the flexibility and granularity necessary to investigate data sets spanning multiple sources, combe structured and unstructured data, and examine them at different levels of detail.

Empowering users with a self-service approach, DAS allows you to be a real data detective.  DAS makes it easy to slice and dice time-sensitive information such as clickstream data, twitter feeds, game events or even emails and discover hidden trends, whether they be long-term or short-lived.  Let’s take a look into the details of how DAS can take the raw materials and uncover something useful in just a few minutes.

Here’s what we’ll cover:

  • DAS and dates
  • Time series reports by day, hour, and beyond
  • Cleaning up dirty time/date information
  • Fine-grained analysis, down to the minute
  • Visualizing trends in DAS dashboards

Make a date with DAS

Firstly, DAS deals with date and time information natively.  Whatever the source, DAS takes your date/time info and gives you a bona fide date which can be manipulated using a calendar.  This is useful for filtering, allowing you to create different windows in different worksheets.  Here’s a screenshot of that.

Slicing and dicing

DAS provides a number of functions which extract bits and pieces of the date, so you can flexibly assembly slices and summarize them (daily click volume, average size of purchases at 4 am, etc).  The functions MONTH(), YEAR(). DAY() and HOUR() are simple ways to grab date/time elements and then group the data just the way you like it.  You can also create complex slices based on multiple pieces, such as an hourly report over multiple days, or one that normalizes time info by time zone (if that’s in your data).  A picture is worth a thousand words, but there’s also a full list of date functions here.

To get at some particulars of the date (like time zone), it’s necessary to use FORMATDATE(), and tell DAS the specific pattern you’re looking for.   There’s an example of that here, and a list of what you can do with that here.

Right about now

Sometimes you need an immediate understanding of what’s happening with your data (and perhaps take immediate action).  DAS lets you react based on the time your report is run.  The functions NOW() and TODAY(), and the ability to use offsets (e.g. +7 days, -6 hours) allow you to determine the freshness of the information you’re analyzing, or the proximity of events, and automate your response. For example, if I want to analyze only web traffic from the last twelve hours, I would do something like this.

When a date is not a date

Date/time information isn’t always well-prepared.  Quite often, dates are embedded within other data, as parts of a URLs, JSON objects, or even large sections of unstructured text.  Fortunately, DAS lets you construct dates out of any raw text, regardless of format, using ASDATE().  Here’s an example.

Up to the minute

DAS will let you drill into tiny time windows, of any size, to identify irregularities in data that appear for only a few minutes or even seconds.  This can be necessary when you’re monitoring financial markets or feeds from social media sites, or just trying to understand system behavior (like erratic usage patterns or fluctuations in web traffic due to downtime).  While there are a number of ways to do this, the simplest is to put your data into bins (buckets) of the desired size.  To do this, DAS provides GROUPBYBIN(). But first, the date must be converted to a numeric value with TIMESTAMP().  As an example, I can group data into five minute slices by writing GROUPBYBIN(TIMESTAMP(#Date);360000).  The big number is the size of the bin in milliseconds.  Here’s a screenshot of that.  If you’re still scratching your head with this one, you might want to watch the video.  It’s toward the end.

The big picture

A picture is worth a thousand words. When you’ve got billions of events, that’s a lot to talk about.  Assembling dashboards to visualize time series analytics with DAS is simple and straightforward, and the results can be enlightening.  For starters, let’s take a look at a chart that compares the volume of tweets about two trending topics over the course of a day.

Time Chart

Looks pretty, doesn’t it?  Now let’s have a look at a similar chart which examines a smaller time range, but with a finer toothed comb; in one minute slices.  See the spike?

That’s something we’d never have seen in the kind of weekly reports you see below.  Yet, those more course-grained reports are still useful in providing an overview to executives.  Why is that important? Well, it depends on your use case, but a number of Datameer’s customers are interested in identifying temporary irregularities or patterns that might represent value opportunities or even critical problems they can’t wait to address.

Watch the video:  If you’d like to get a little more hands-on, I’ve posted a live demonstration illustrating all these concepts here.

Posted in How-to, Uncategorized | Tagged , , , , | Leave a comment | RSS