Showing posts with label SEO. Show all posts
Showing posts with label SEO. Show all posts

Tuesday, September 3, 2013

Big Data Analytics solutions for Online Marketing - Use Case 1

A sample Online Marketing application deployed in the Big Data Architecture, is shown below.



Online users search for products, services, topics of interest etc. not only in Google and other search engines, but also more importantly on site itself (For example, in eCommerce site Amazon.com, search is the top product finding method used by site visitors). Facilitating searchers by providing relevant search results is something online search providers like Google, Bing and also site search providers continuously optimize and calibrate.

From an Online Marketing perspective, once the searchers click through the search results and arrive at the website (if coming through external search like Google) or arrive at the product or topic page they were searching internally on the site, that page of arrival from a search result, called as landing page in Online Marketing terminology, is very important for:
  • Improving Conversion Rate (%) of the site.
  • Traffic dispersion to subsequent stages of the site.
  • Improving site engagement for the users 

As already discussed in a previous post, delivering dynamic and search relevant landing pages is very important, particularly for large websites like eCommerce stores, Music & Movie download sites, Travel websites etc.  While delivering keyword or search relevant landing pages dynamically across thousands of keywords, perhaps across hundreds of thousands of keywords for large websites, itself is a big challenge; even bigger challenge is to deliver these dynamic, search relevant landing pages targeted to each of different user segments. As already discussed previously, luckily Big Data Analytics solutions are available now to solve these Big Data challenges in Online Marketing.

Large websites generate and also need to process, huge volumes of different varieties of data as below:

  • Website clickstream data collected through Web Analytics applications like Omniture and from webserver logs.
  • The website content such as product content, marketing content, navigation etc. in various formats like text, images, videos etc. which is available in the web content management systems.
  • External web content typically collected by web crawlers, which includes content such as
    • Product content from competitor websites
    • Marketing collaterals from external industry websites etc.
  • User generated content such as product reviews, user survey feedback, social media posts, online discussions, tweets, blog posts, online comments, Wiki articles etc.

Most of the above varieties of data are unstructured or semi-structured, and hence cannot be collected and processed in traditional RDBMS databases like Oracle or MySQL.

For large websites, it is not just important to collect large volumes of variety of data as shown above, but it is also important to handle the velocity at which all these data is getting generated online, particularly clickstream data and user generated content.

This is where Big Data Analytics solutions come in. In this above example, a typical Architecture to support Big Data Analytics is solutioned using open source Apache Hadoop framework.  In an Hadoop architecture - big volumes, variety and velocity of online data are collected and then stored in HDFS file system. Hadoop architecture also provides RDBMS like databases such as HBase, for storing big data in traditional style, particularly useful for beginners and new users of these Big Data Architectures. As we can see in this example, a big data landing zone is set up on a Hadoop cluster to collect big data, which is then stored in HDFS file system.

Using Map-Reduce programming method, Online Marketing Analysts or Big Data Scientists or Analysts develop and deploy various algorithms on a Hadoop cluster for performing Big Data Analytics. These algorithms can be implemented in standard Core Java programming language which is the core programming language used for executing various services for collecting, storing and analyses of big data in a Hadoop architecture.  Additional programming languages like Pig, Hive, Python or R can be used to implement the same algorithms with less number of lines of code to be deployed. However code written in any of these additional languages would still be compiled into Core Java code by Java Compilers for execution on Big Data Hadoop Architectures.

Some of the use cases of Online Marketing Algorithms which can be implemented on Hadoop Architecture for deriving Analytics are shown in the same example. All these algorithms are deployed using the Map-Reduce programming method.

  • Keyword Research: Counting the number of occurrences in content and search for hundreds of thousands of keywords across the diverse variety of data collected into Hadoop and stored in HDFS. This algorithm would help identify top keywords by volume, and also the long tail of hundreds of thousands of keywords searched by users. Even new hidden gems among keywords can be discovered using this algorithm to deploy in SEM/SEO campaigns.
  • Content Classifications / Themes: Classify the user generate content and also web content into specific themes. Due to huge processing capabilities of Hadoop Architecture, huge volumes of content can be processed and classified into dozens of major themes and hundreds of sub themes.
  • User Segmentation: Individual user behavior available in web clickstream data is combined with online user generated content and further combined with user targeted content available in web content management systems to generate dozens of user segments, both major & minor segments. Further this algorithm would identify the top keywords and right content themes targeted for each of the dozens of user segments, by combining the output from other algorithms used for Keyword Research and Content Classifications.

Also, since the Hadoop Architecture is running on clusters of computers, all the above algorithms can not only process huge voluminous amounts and varieties of data, but can handle data in motion which keeps coming into the Hadoop Big Data landing zone in near real time. This would enable the Online Marketing Campaigns to be tweaked in near real time to derive better ROIs from Online Marketing spends.  In the example illustrated above, the output from the 3 algorithms running in parallel, is dynamic Keyword Relevant Content Rich User Targeted Landing Pages generated in near real time, for hundreds of thousands of keywords, across dozens of content themes and targeted across dozens of user segments. This output would be integrated with eCommerce platforms or Web Content Management Systems or with Web Portals for creation, production & delivery of Keyword Relevant Content Rich User Targeted Landing Pages in near real time.


Signature: Roopkumar T.V.

Saturday, August 10, 2013

Dynamic and Search Keyword Relevant Landing Pages

One of the biggest challenges faced by Large Dynamic websites like eCommerce Stores, Music or Movie download websites, Travel & Hospitality websites etc. is that they don't always deliver the most relevant landing pages to the Visitors who arrived at their website from Google and other search engines.

Recalling a much simpler earlier post on this topic, the disadvantages of not delivering the most relevant landing pages to users who arrived from Google or other search engines would include
  • Lost sales or lead generation opportunity
  • Lost opportunity to build engaging long term customer relationships or customer loyalty
  • Bad reputation and negative feedback, even negative reviews
  • Lost investments on the website
A landing page is the first touch point on the website for users coming from search engines. Users will not spend more then 10 seconds on a website, which has a irrelevant landing page. Capturing the users interest in the first 10 seconds is very important, and this is only possible by delivering the most relevant content to Visitors consistently.

The large dynamically changing websites would be searched and found in search engines like Google, across thousands of keywords. The top searched keywords would keep changing for large dynamic websites each month, or perhaps even each week. Also there would be a long tail of thousands of keywords, in some cases hundreds of thousands of long tail keywords for large websites. Hence delivering Search Keyword Relevant Landing pages across thousands, perhaps hundreds of thousands of keywords is always a challenge for large dynamic websites.

Good news is that solutions are now available to help large dynamic websites, to always deliver Search Keyword Relevant Landing pages across hundreds of thousands of keywords. All these solutions leverage the Big Data Analytics Platforms. 

Big Data Analytics Solutions would help organizations to always deliver Dynamic and Search Keyword Relevant Landing Pages across hundreds of thousands of keywords always and for every search.

Big Data Analytics Platforms benefit Organizations in discovering potential keywords for SEM and SEO by 
  1. Scanning or crawling all their content found anywhere on the internet, discovering hundreds of thousands of potential keywords for which their content (websites, images, videos, mobile apps, social apps, pages, blogs, Facebook fan pages etc.) could be discovered in Google search. 
  2. Scanning or crawling the content of direct competitors to discover additional potential thousands of keywords.
  3. Scanning similar content across other websites, blogs, social media, mobile or social apps, images, videos etc. to discover the long tail of potentially hundreds of thousands of keywords.


By integrating these Big Data Analytics Platforms to their Web Portals, E-Commerce Platforms, Content Management Systems, Business Process Management systems etc. - Organizations can deliver Dynamic and Search Keyword Relevant Landing Pages across hundreds of thousands of keywords always and for every search. This methodology would be discussed in more detail in future posts.

Signature: Roopkumar T.V.

Monday, August 5, 2013

Online Multi-Channel Campaign Attribution

While I prefer to write my own posts, based on my own experiences - This post is credited to the official Google Analytics website.  This post and the embedded video from Google Analytics team provides an overview of online multi-channel campaign attribution in most simple and effective way then anywhere on the web.

Original link to Google Analytics site on online Multi-Channel Campaign Attribution.




Signature: Roopkumar T.V.

Wednesday, July 31, 2013

Relevant Landing Pages for Search keywords

I will start my first post with a simple and effective method to improve the user experience for searchers who come to your website from Paid search campaigns using search engines like Google. Give the users the most relevant landing page.

If the user searched using a generic keyword or brand keyword, it would mean the user is still in awareness stage of purchase life-cycle. Send these users to home page.

If the user searched using specific keywords, related to specific categories in the content or product hierarchy of the site, direct them to relevant category pages. Users are mostly at a selection stage of purchase life-cycle.

If the user searched using very specific product names or for very specific content, it would mean the user has reached the buy stage of purchase life-cycle. Direct these users to most specific inner landing pages in the site serving the very specific content or products searched by these users.

Following the above points, would ensure that the first step in the digital user experience of a user coming from search engines is taken care to


  1. Reduce bounce rates of landing pages
  2. Increase next page traffic dispersion to subsequent pages from the landing pages.

Signature: Roopkumar T.V.