The Blog

How to access, cite, and defend web datasets in academic research

Posted on November 24, 2016 by

We’re used to getting questions about accessing structured web data. But recently, we’ve been fielding a different kind of use case.  Researchers and scientists have been asking about data citation conventions and how to defend research citing web datasets for peer review. As you might expect, we published our answers in the new Guide to Citing Web

Continue reading

Posted in Big Data | Leave a comment

Can Crawled Web Data Tell the Future?

Posted on November 14, 2016 by

Robert Tercek’s book Vaporized: Solid Strategies for Success in a Dematerialized World recently recently won GetAbastract’s 2016 International Book of the Year award at the Frankfurt Book Fair. Based in Hollywood, Robert has  spent his entire career creating interactive content and inspiring others to do the same. He was kind enough to share a few words

Continue reading

Posted in Big Data, Marketing | Leave a comment

Web Data Visualization of The Hillary Clinton Top 100 Network Graph

Posted on October 20, 2016 by

The web data business can get pretty tricky, especially when your job is to extract the broadest possible dataset from the planet’s biggest database. Last week, Webhose CEO Ran Geva ran a fun experiment to visualize Hillary Clinton’s web network. More precisely, who are the top 100 people most frequently mentioned in news articles and blog

Continue reading

Posted in API, Big Data, Technology | Leave a comment

Should you buy crawled web data or build your own solution?

Posted on October 10, 2016 by

In a technologically driven environment, the temptation to develop a proprietary web crawling solution is virtually irresistible. Our latest report examines the true cost of computing and software development resources required to deliver a data crawling and structuring solution at scale: Development & Maintenance Development could mean coding a proprietary solution from scratch, or modifying an existing crawling

Continue reading

Posted in API, Big Data | Leave a comment

Top 10 Big Data Stories Leading the Conversation

Posted on September 26, 2016 by

In the right hands, crawled web data can tell an amazing story. We were interested in the top 10 news stories – sorted by social shares on Facebook and LinkedIn. So we set up a simple news API request. We were looking for the stories published over the past 30 days returned by an exact match query for the term “big data”.  Here

Continue reading

Posted in Big Data, Technology | Leave a comment