Logo Website Test

Website-test logo
← Click to update



 Generated on March 24 2016 16:00 PM

Analyse again


Download PDF Report

* The results are cached for 30s. If you have made changes to your page, please wait for 30s before re-running the test.

SEO Content

Title Web Data Extraction Using Artificial Intelligence - Diffbot

Length : 59

Perfect, your title contains between 10 and 70 characters.

Length : 0

Description is missing. It will tell search engines and potential visitors what your page is about. Keep it between 70 and 160 characters (spaces included).

Note: Google sometimes chooses not display the description you put in here, but rather shows a part of your page text that it seems relevant to the users search query..
Og Meta Properties This page does not take advantage of Og Properties. When people share or like your page, it is great to have good Og properties. Otherwise, they will just see a link to your website.

In case of Facebook or Twitter this is how you get started:

Facebook: https://developers.facebook.com/docs/sharing/webmasters

Twitter: https://dev.twitter.com/cards/getting-started
H1 H2 H3 H4 H5 H6
3 13 11 0 3 0
  • [H1] Nothing Personal.
  • [H1] Meet structured web data.
  • [H1] As seen in...
  • [H2] Artificial intelligence
  • [H2] for companies that revolutionize online shopping
  • [H2] Artificial intelligence
  • [H2] for companies that are mobile-first
  • [H2] Artificial intelligence
  • [H2] for companies that invented Internet networking
  • [H2] Artificial intelligence
  • [H2] for web data you can bank on
  • [H2] It's just that Diffbot's computer brain can do some things better.
  • [H2] Diffbot integrates in minutes and starts returning data in seconds.
  • [H2] Diffbot in the press.
  • [H2] Get Started Now. Try Diffbot free for 14 days!
  • [H2] Test Drive the Article Analyze API Article API Discussion API Image API Product API Video API
  • [H3] The largest retailers in the world use Diffbot to retrieve online shopping data for competitive analysis, business intelligence and to build new consumer experiences.
  • [H3] Startups have revolutionized mobile experiences by delivering only the content that users want and removing all the junk. Instapaper uses Diffbot to pull out article text, pictures, headlines and authors so a user can just click "read later" and enjoy.
  • [H3] One of Silicon Valley's greatest success stories built its business by listening closely and responding to feedback from its users. They now use Diffbot to automatically monitor comments and forum posts to better understand their customers and react in real-time.
  • [H3] After five years, thousands of customers and billions of web pages, we're only just getting started. Diffbot is proud to announce new funding to support accelerated efforts in structuring the web's information.
  • [H3] Accurate
  • [H3] Comprehensive
  • [H3] Cost-effective
  • [H3] Crawl entire sites automatically
  • [H3] Extract anything, anywhere
  • [H3] Automatic page classification
  • [H3] Individual web page extraction
  • [H5] Product
  • [H5] Resources
  • [H5] Company
Images We found 80 images on this web page.

80 alt attributes are empty or missing. Add alternative text so that search engines can better understand the content of your images.
Text/HTML Ratio Ratio : 14%

This page's ratio of text to HTML code is below 15 percent, this means that your website probably needs more text content. Search engines these days are smart enough to figure out if a page is on-topic and interesting enough to get higher rankings in their search results.

Try to write a good article around 1500 words and check for grammar and spelling errors. As they say in the SEO world: "Content is King."
Flash Perfect, no Flash content has been detected on this page.
Iframe Great, there are no IFrames detected on this page.

URL Rewrite Good!
Underscores in the URLs Perfect! No underscores detected in your URLs.
In-page links We found a total of 23 links including 0 link(s) to files

Anchor Type Juice
- Internal Passing Juice
Products Internal Passing Juice
Pricing Internal Passing Juice
Why Diffbot? Internal Passing Juice
Company Internal Passing Juice
Login Internal Passing Juice
14 Day Free Trial Internal Passing Juice
See the News Internal Passing Juice
Join the Team Internal Passing Juice
See how Crawlbot can help » Internal Passing Juice
Find out more about Custom APIs » Internal Passing Juice
Automatic APIs Internal Passing Juice
Custom API Internal Passing Juice
Crawlbot Internal Passing Juice
Plans & Pricing Internal Passing Juice
Documentation Internal Passing Juice
Support Internal Passing Juice
Libraries Internal Passing Juice
API Status Internal Passing Juice
About Internal Passing Juice
Contact Internal Passing Juice
Terms Internal Passing Juice
Privacy Policy Internal Passing Juice

SEO Keywords

Keywords Cloud article artificial diffbot test companies drive data api web image
Keywords Consistency
Keyword Content Title Description Headings
api 7
diffbot 6
image 5
web 5
companies 4


Url Domain : diffbot.com
Length : 11
Favicon Great, your website has a favicon.
Printability We could not find a Print-Friendly CSS.
Language Good. Your declared language is en.


Doctype HTML 5
Encoding Perfect. Your declared charset is UTF-8.
W3C Validity Errors : 101
Warnings : 4
Email Privacy Warning! At least one email address has been found in plain text. Your email address will be used for spam. Please consider one of these options:

1. Integrate a contact form on your site.
2. Make it difficult for spam bots by using an obfuscator: Email Address Obfuscator
Deprecated HTML Great! We haven't found any deprecated HTML tags.
Speed Tips
Excellent, your website doesn't use nested tables.
Your website is using inline styles.
Great, your website has a few CSS files.
Your website has too many JS files (more than 6).
Your website does not take advantage of gzip.


Mobile Optimization
Apple Icon
Meta Viewport Tag
Flash content


XML Sitemap Missing

Your website does not have an XML sitemap - to make it easier for search engines to find your pages it is wise to generate a sitemap.

A sitemap lists URLs that are available for crawling and can include additional information like your site's latest updates, frequency of changes and importance of the URLs. This allows search engines to crawl the site more efficiently.

Go here to know more on how to set this up: http://www.sitemaps.org/protocol.html

If you have a lot of pages (up to 500 pages is free) you can auto generate them here: https://www.xml-sitemaps.com
Robots.txt http://diffbot.com/robots.txt

Great, your website has a robots.txt file.
Analytics Missing

We didn't detect an analytics tool installed on this website.

Web analytics let you measure visitor activity on your website. You should have at least one analytics tool installed, but it can also be good to install a second one in order to cross-check the data. This can be Google Analytics or Clicky.com

PageSpeed Insights


Clicky Web Analytics