Vaughn's Summaries logo Vaughn's Summaries

Internet Summaries
Content Theft

My Content Was Scraped by
Health-heart.net in 2011
Health-Heart.net is An Internet Content Thief
by Vaughn Aubuchon
Updated February 24, 2011

DEINDEXED! January 26, 2011
FIXED! January 31, 2011 - 11:00 AM PDT - Details below
DEINDEXED! February 10, 2011 - Details below
FIXED! February 12, 2011 - 9:00 PM PDT - Details below

In the past, my website content has been scraped many times, by various Internet thieves. The scraped content has been displayed pages deep in the SERPS by Google, so I wasn't too concerned.

But this time, not only was my content scraped, but Google has given the scraper the NUMBER ONE POSITION, using my content. I have been losing THOUSANDS of impressions per day, because of this situation. So has AdSense, since the scraper removed it.

My ORIGINAL content location -
http://www.vaughns-1-pagers.com/medicine/blood-pressure.htm

The STOLEN content location -
http://www.health-heart.net/Normal-Blood-Pressure-Chart.htm (copy and paste)
As you can now see, the stolen content page has been removed, as of Feb. 1, 2011.

Discussion at Google Webmaster Central

Discussion at Brett's Webmaster World

DMCA.doc

Screenshots -
My Original Page (top)

My Page Scraped by Health-Heart.net (top)

My Original Page (bottom)

My Page Scraped by Health-Heart.net (bottom)

Whois Information on Health-heart.net

Ranking Factors Compared

Loss of Traffic Summary

Other Theft of My Blood Pressure Page Content

Disclaimer
.

-

80

30



top of page

Here Is My Original Page
Top of my page screen shot
canonical page - top
This is my original, canonical web page.






top of page

Here Is My Page - Scraped by health-heart.net
Top of scraped page screen shot
scraped page - top
This is the scraped content web page.

Note that they have removed my logo, and replaced it with their own logo.
They have also removed the link to my Spanish version of the page.
They also removed my URL, and replaced it with their logo.

.




top of page

Here Is My Original Page
Bottom of the page screen shot
canonical page - bottom
See the copyright information?




top of page

Here Is MY Content, Scraped by Health-heart.net
Bottom of the page screen shot


scraped page - bottom
Note that they have edited out my copyright information.
Brazen, documented, intentional theft, in your face.
I have archived all their stolen content files for future reference.




top of page

The SCRAPER
http://www.whois.net/whois/health-heart.net
It is time for me to send a DMCA to
health-heart.net
They have no contact page, about page, or privacy page.
(from whois) Email: whois@bluehost.com
Provider: Bluehost.com


top of page

The PROVIDER
http://www.whois.net/whois/bluehost.com
Bluehost, Inc.
1958 South 950 East
Provo, Utah 84606
I called the number listed on their web site, 888-401-4678
but they don't answer their phone - they use infinite hold as an interface tactic.
It looks like Bluehost is in the business of hosting INTERNET SCRAPERS, and is not responsive.
Their website provides NO EMAIL ADDRESS, and they will not answer the phone.
UPDATE: I have sent a Cease and Desist Order to their email address listed at whois
whois@bluehost.com
on January 29th, 2011 - at - 12:08 PM PDT.

Here it is in GIF format.
Cease and Desist Order




top of page

Page Ranking Factors Compared
The Facts Don't Count Anymore

Ranking Factor

My ORIGINAL
Blood Pressure Page
.
STOLEN
health-heart.net
Scraped Page

Domain Registration
Date from whois

2003-10-16
2010-12-05

BP Page Age
from Wayback Machine

7 years
1 month

Domain PR

5
0

Blood Pressure Page PR

4
- 1

Page Backlinks from
Yahoo Site Explorer

472
2

Page Backlinks from
Google Webmaster Tools

3,318
?

Page Backlinks from
Google Webmaster Tools
to the PDF version
of JUST the graphic

6,615**
na

Total Backlinks
for Site
Google Webmaster Tools

37,000 plus
?

Google Result -
Bottom Line

Page vaporized
# 1 SERP Position
# 1 SERP Position
Page vaporized

False canonization.
Obviously, Google indexing and search are broken.
The question is, for how long will they stay that way? 6 days

Both Bing and Yahoo give my page the #1 position in their SERPs, for the search phrase - blood pressure range. The problem was with Google exclusively.

** Up until Jan. 31, my only link to the graphic PDF page was from the main, complete html blood pressure page.

200 - 450

125

125




top of page

Loss of Traffic Summary
My page performance in the SERPs
Here is the result of my Blood Pressure page being -
Hijacked and
Canonized to Another Site by Google

Whereas, my page has come up #1 for the Google search - blood pressure range - FOR YEARS, my EXACT content (with a few omissions) is now being presented by Google as being the property of health-heart.net, IN THE #1 SPOT! My original, canonical page is gone.

How is it that Google is displaying MY CONTENT as belonging to health-heart.net?
This has been my most popular page, for over 6 years.

Anatomy of A Fix -
and Subsequent Demise - And Recovery

To illustrate my point with some data --->
Here is a summary of my recent Blood Pressure page popularity in Google -

42.2% of all my impressions - All of 2010

41.5% of all my impressions - January 3-10, 2011
41.4% of all my impressions - January 11-17, 2011
40.3% of all my impressions - January 18-24, 2011
36.7% of all my impressions - January 25, 2011

15.6% of all my impressions - January 26, 2011 - The BIG CRASH - New algo went live
12.5%
of all my impressions - January 27, 2011
  9.0%
of all my impressions - January 28, 2011
  9.4% of all my impressions - January 29, 2011 **
  9.1% of all my impressions - January 30, 2011 ***
  8.8% of all my impressions - January 31, 2011 (10 AM)***
11 AM - FIXED! Even I am seeing my site's page at the top again!
11.8% of all my impressions - January 31, 2011 (11 AM)
14.0% of all my impressions - January 31, 2011 (noon)
16.7% of all my impressions - January 31, 2011 (1 PM)
18.4% of all my impressions - January 31, 2011 (2 PM)
24.1% of all my impressions - January 31, 2011 (6 PM)
25.7% of all my impressions - January 31, 2011 (8 PM)
26.5% of all my impressions - January 31, 2011 (10 PM)

36.7% of all my impressions - February 1, 2011 - Normal
38.5% of all my impressions - February 2, 2011 - Normal
37.9% of all my impressions - February 3, 2011 - Normal
39.2% of all my impressions - February 4, 2011 - Normal
38.0% of all my impressions - February 5, 2011 - Normal
37.3% of all my impressions - February 6, 2011 - Normal
39.4% of all my impressions - February 7, 2011 - Normal
39.4%
of all my impressions - February 8, 2011 - Normal
31.6%
of all my impressions - February 9, 2011 - Normal for 9 days

17.3% of all my impressions - February 10, 2011 - PAGE DROPPED FROM INDEX
14.4% of all my impressions - February 11, 2011
15.4% of all my impressions - February 12, 2011 (7 AM)
24.0% of all my impressions - February 12, 2011 (9 PM) FIXED
. . .
40.0% of all my impressions - February 24, 2011 (9 PM) STILL FIXED - Yeah!

Was this caused by the content thief? Was this caused by the new Google algo?
STILL #1 on Bing and Yahoo.

** This small data reversal corresponds with concurrent online reports of MY ORIGINAL PAGE RECOVERY to the #1 position in the SERPs. I don't see it yet, but others do.
*** Whoops, false alarm - still holding steady at the bottom. No fix yet. I believe that most of these impressions are from Bing and Yahoo, where I STILL rank #1 for the search.

When Google fixed this, I saw an immediate improvement, as shown above.

Has the content ownership issue simply not been addressed adequately? I don't think it has. This data point should help contribute toward a fix.

I am reminded of the Seinfeld rental car episode -
"You know how to TAKE the reservation, you just don't know how to HOLD the reservation.
And that's really the most important part of the reservation - the holding. Anybody can just take 'em."
Google knows (knew) how to ESTABLISH the canon, but they don't know how to HOLD the canon. And that's really the most important part of the canon - the holding.




top of page

Other Theft of My Blood Pressure Page's Content

Jane at Google helped me out back in June 2008, with the theft of my MAIN BLOOD PRESSURE GRAPHIC, by several entities. I will now continue to document that incident here, over the next several days. This large graphic is arguably the most important part of the HTML page.

February 1, 2011
My ORIGINAL MAIN GRAPHIC is STILL being displayed by Google Image Search, by various offenders -
http://images.google.com/images?hl=en&q=Blood+Pressure+Chart

1. Thief #1 -
doddsfamilyfoundation.org (page 1 in image results)
If you click on the image, you are redirected to a blank page.

2. Thief #2 -
health-heart.net (page 3 in image results)

3. Thief #3 -
life.digitss.com (page 4 in image results)

4. Thief #4 -
fithealthyliving.com (page 4 in iamge results)

5. Thief #5 -
holistichealthhouston.com (page 4 in image results)

6. Thief #6 -
rajee.sulekha.com (page 5 in image results)

7. Thief #7 -
seanbanville.com (page 5 in image results)

8. Thief #8 -
wellsphere.com (page 6 in image results)

9. Thief #9 -
itsselvam.blogspot.com (page 6 in image results)

10. Thief #10 -
mylot.com (page 7 in image results)

11. Thief #11 -
funnrock.com (page 7 in image results)

12. Thief #12 -
desidieter.com (page 7 in image results)

13. Thief #13 -
fithealthyliving.com (page 8 in image results)

14. Thief #14 -
waystolowerbloodpressurefast.blogspot.com (page 11 in image results)


http://resources.alibaba.com/topic/39249/Human_Blood_Pressure_Range_Diagram.htm

ANYBODY GOT ANY IDEAS? What would YOU do?



top of page

Vaughn's Other
GOOGLE-SEO Related Pages

* Google Data Centers

*
Google Ranking Factors

*
Google Ranking Updates

*
Google Webmaster Forums

*
My Scraped Content

*
My Stolen Content
.




Disclaimer
This page was created for several reasons -
1. to EXPOSE the Internet scraper Health-heart.net, as well as the ENABLING host Bluehost.com
2. to address the de-canonization of my 7-year-old popular Blood Pressure page
3. to have a reference point for online discussion - to go fully open kimono
4. to make a strong statement to others who contemplate stealing my content
5. to set an example for other webmasters who may wish to take a similar tack
6. to maintain a PERMANENT record of ALL those who have stolen from me
7. To document my page's subsequent demise
8. and recovery.

Although the author makes every effort to verify the information on this page, no information on this page is guaranteed to be correct, and any data contained herein may be erroneous.
The opinions stated above are merely the personal observations and opinions of the author.

top of page

Tags: scraped content, stolen web page, health-heart.net scraper,
Bluehost hosts content thieves, DMCA



Vaughn Aubuchon Author Bio

Vaughn's Summaries
©2011, 2017 Vaughn Aubuchon
www.vaughns-1-pagers.com
All Rights Reserved
Site Map

This Vaughns Health-Heart Scraper summary
page was last updated on 2017-07-05.