Mark Cuban's Mahalo Wants Your Blood (And Gets it TOO!)

Mark Cuban recently talked about how search engines and content aggregators are vampires.

There is no reason to be indexed in Google. ... You haven’t gotten anything back

But he failed to disclose how his Mahalo investment loots content.

If Google is a vampire (while sending away billions of Dollars of traffic for free) then what does that make Mahalo (which borrows your titles and abstracts as content to pull search traffic into their ad cluttered pages pages, while placing your content below the fold (while using nofollow on attribution links))?

Is the following accurate?

If you think otherwise, then please explain. ;)

Danny Sullivan TORE UP Mark Cuban in a must read article which only Danny could have wrote. It is well worth a read for anyone who wants to understand the hypocrisy behind the Mahalo position on content scraping / vampiring.

Slow & Steady vs Hype, Crash & Burn

HOPE is perhaps the single most lucrative thing to sell.

There are so many people in need of direction, while so few actually want to do the work required to achieve the end goal. Thus many scammers sell the end result up front while glossing over the hard work required to get there.

I was over at a friend's house in the bathroom and saw a copy of "Your Infinite Power to be Rich" sitting on the floor & flipped open to a random page...page 102

THE UNIVERSAL BANK

A salesman needed an automobile for his new job, but he had no money with which to purchase one. However, he knew how to draw a check on his mental bank.

He told me that after he got the job, he went back to his room and formed the mental image of the car he wanted, with the positive certainty that it would be given him.
...
He struck up an acquaintance with another man in his apartment house who was going to Europe for six months and who said to him, "use my car until I return, and by that time you will be able to buy a car of your own."

If you ever read crap like the above please make sure to burn it, as it is useless.

Anything that requires you to close your eyes while listening to a marketer should make you assume they are preparing to work on another one of your orifices.

Many people who become rich are still unsatisfied by material possessions. And they often let the important things around them fall apart because they are too singularly focused.

Irrational Tweet From a Rich Man

A couple years ago a somewhat well known VC wanted to invest in us, but we had a bad gut feeling right from the get go.

Fast forward to December of 2009 and the guy who did that was Tweeting about a hate site he made for his wife, who he is now going through the divorce process with. Not once, but something crazy like a half dozen times. And in between these Tweets he is Tweeting...

  • asking if anyone knows a bulldog divorce lawyer
  • about his new self published book which contains the word Peace in its title
  • how he needs some new executives for some projects

Who is the desired audience there? I mean after a person knows you will put up a hate site for your own wife, that you would be the type to sick bulldog lawyers on them, and that while you are doing so you are talking about Peace and are trying to recruit business partners ***in the same channel***

Crazy irrational.

But that is what happens when people are emotionally charged and lack balance.

Greed is justified by more greed and nothing else matters.

But what is the end goal?

You can't take money to your grave*

*Even the king of pop's gold plated casket only cost $25,000.

The Caustic Effects of Get Rich Quick Marketers

Ryan Healy recently pulled back the curtains on many internet marketing gurus, the lawsuits amongst them, and the general damage they inflict onto the market. Fake retirements used to cloak legal restraining orders against certain business practices, not paying affiliates, credit cards shutting down payment processors, etc etc etc.

The people who sell the image of the perfect lifestyle to suckers are the exact same people doing business deals with "partners" in the court room and going through divorce...something so scary sounding that I couldn't imagine it.

If you are already drowning in cash, but can't be honest with yourself and your loved ones, then why the need for a few more Dollars? What will they buy? Some hookers and a few STDs?

The Big Banks Are Just as Bad

This sort of crap happens at all levels though. It is so ingrained that many people just assume that if you make a lot of money you must be criminal or doing something morally reprehensible.

And from an affiliate perspective, when you look at the market segments that pay the most it is often the seediest ones (or the ones that are propped up by systemic fraud). A few years ago the mortgage market would pay a lot for leads because that is where the systemic fraud was.

Exhibit A:
2004, CNN.com - FBI warns of mortgage fraud 'epidemic'

Exhibit B:

Imagine if you had key market insights and could trade on unreleased government information. A guaranteed source of easy profit exploited by some (especially when those people in government used to work for your company). And yet it is not enough. They need to steal more. There is a sickness in society that stems from our broken model of capitalism & materialism...where the central bankers flat out lie/cheat/steal to make even more money.

A nice take on it:

Despite the housing bust and financial crisis, very many of the people whose poor choices generated the housing bubble would make the same choices over again if circumstances repeat. Many industry participants, even those whose firms eventually went bust, were very well remunerated for their poor practices and, whatever their regrets, they kept the money. No one wants to create a catastrophe. But financial professionals want to remain free to make money in the ways that they know, and those are not good ways.

In July of 2007 former Citi CEO Charles Prince said, "As long as the music is playing, you’ve got to get up and dance. We’re still dancing."

And now that those people crashed the economy, the opportunity is to sell get rich quick at home doing nothing in your underwear overnight guaranteed. And there is money to be made in helping you fix your credit (since the above mentioned criminal elite class got a free pass while stealing your money and devaluing your savings while robbing the country blind).

Time vs Character

In an environment where such bubbles are core to the economy there is a lot of uncertainty. What is the best strategy? Who should I trust?

One of the best strategies I have found is simply time. Give a shady person enough time and they will reveal their character (divorcing their wife over money, etc). Granted I haven't always been perfect (and especially not when I was in the military), but you can't find many (any?) blog posts about me ripping someone off. Likewise with the people I look up to. Where is it shown that Seth Godin or Eric Janszen or Danny Sullivan took someone for a ride? Nowhere. In spite of a a decade+ of experience.

Every day there is an opportunity to max short term revenues or long term staying power. Each choice and each interaction is somewhere on a continuum. Focus too much on short term revenues and a lot of the things that set you apart disappear.

One of the things Google does with their relevancy algorithms is to trust older and more established websites. You can fake a lot of things, but it is a bit harder (or more expensive) to fake age. And age is what sets apart a lot of the legitimate businesses from the above listed "entrepreneurs" who only need time to reveal their character.

Business vs Base Jumping?

Starting a business is a lot like more like base jumping than it is just following a hope map. Most the stuff you do won't work, but you only need to stay in one piece until you safely reach the ground. Sure you must have hope to get through the bad patches, but you also are forced to constantly improve to keep up with the market. Which is why a site called SEOBook.com sells an SEO training program (rather than an ebook) in 2010. ;)

Here is one of the secrets in plain sight that the opportunistic types rarely show you.

That was the growth of search volume last year. STILL over 20%!

Years ago my mentor told me that SEO is a marathon, not a sprint.

In the longrun thin won't win, but (so long as you care) you can start off bad on low volume and get somewhere pretty quick when your field is growing at a 20%+ rate. And if you are new to the field you should be able to grow faster than the market because many business inputs have multiplier effects. You increase your growth geometrically as you

  • learn to optimize your traffic flow
  • increase your knowledge
  • increase the value of your knowledge
  • refine your strategy
  • refine your product or service
  • improve your conversion rates

Slow and steady isn't sexy. And it doesn't sell well.

But it works. :)

A further benefit to slow incremental growth is that as you grow your tribe and focus on their needs your customers become salesmen...helping attract more people just like them. And these are people who are pre-sold on what you have to offer, and why it is valuable. To a jaded audience testimonials from friends are far more valuable than sales copy. And almost everyone gets screwed into buying junk at least once before they find you.

Sales copy will likely push the quick returns no matter what (because that is what people want), but pay special attention to if someone is trying to use an aura of mystique on a brand new discovery as a marketing angle, while having little history. If they don't have much history the odds of buying a bag of smoke are much greater.

And if the recommendation comes with a big loud launch sequence then the chance of it being crap are even greater. And even if it starts out pure, the aim to "optimize" revenues at any cost often causes many partnerships to dissolve. Starting off slow and steady keeps things balanced and prepares you for growth.

Why Heavily Hyped Launches Are Often a Bad Idea

Wealth that comes quickly and easily often disappears the same way. Everyone has their hands in the cookie jar of success until the cookies are all gone.

When you are new to a market there are so many things to learn, refine and improve. Typically customers driven by hype are the most demanding (because an affiliate often oversells the product to get the commission) and the least qualified to succeed (since they want to rule the world in a day). The customers sold on a whole lot of hype and a whole lot of hope are basically set up to fail, trying to go too far too fast. They tend to buy on impulse, lack follow through, have a far higher rate of churn, a far higher rate of refund requests, and a far higher rate of chargebacks.

Further, every piece of a business can be optimized - from choosing who you want your customer to be, to who you don't want it to be, to what types of interactions to build, to what prices to charge, to balancing time spent on servicing customers vs growth, etc etc etc ... right on through to managing your personal load and fixing programming bugs (when we first launched our membership site the programmer made it such that if a person canceled they couldn't rejoin (even if the cancelation was due to an expired or stolen credit card))! But if you keep accepting feedback and incrementally improving you prepare yourself for heavy load by the time you build it.

Whereas a pull the cord and hope this works approach with lots of hype will almost always lead to disappointment and frustration. You probably want to test the equipment before jumping off the mountain :D

Even if the claims about non-payment in this lawsuit are NOT correct, it doesn't make a business look professional if they have publicly accessible lawsuits claiming that the fulfillment partner of choice is incompetent.

Long after the launches details leak out and experiences are shared. And that is what builds your reputation...good or bad. The slow and steady growth model offers time to fix errors and refine strategy. The launch and hope model doesn't.

More PROOF Jason Calacanis is a ____

Publicly Jason claims to be ignorant about SEO because it allows him moral flexibility and makes Google less likely to torch his site (even though he is blatantly violating their search quality guidelines, and has for *years*).

But when you look at the sales material that Mahalo pitches to corporations, in the 19 page PDF reads like an à la carte menu of SEO services, rather than sales material from a company ignorant of of SEO.

It includes a slide which highlights how well Mahalo Answers questions rank in Google titled "SEO value," as well as the following statements (followed by my comments):

  • Questions are imported from Partners’ Answers Community into Mahalo Answers, enabling 100% share of voice and high SEO value. (filling Google with duplicate content)
  • Category Selection Based on Keyword Intelligence and Customer Goals (doing keyword research, an SEO service)
  • Community seeded with high-value questions and answers (does the word "seeded" mean asking fake questions?)
  • By carefully policing the site, Mahalo keeps out inappropriate content, thus increasing engagement and utility. (no mention of the half million+ pages indexed in Google which contained scraped 3rd party content?)
  • We can help our partners increase their search engine rankings with these high quality pages. (that is the actual text from their slide titled "HowTo")
  • Mahalo’s team of editors will find the most highly-trafficked search terms and keywords for your brand, industry or product and build corresponding high-quality pages that will rank well. (isn't that exactly what "scummy" SEO companies do Jason?)

Given that Mahalo is now branded as an SEO play (in their own words), and that they scrape millions of content listings to publish on their pages, are creating tons of other duplicate content, have actively engaged in link farming, and are not above "seeding" questions based on keyword value, why should Google trust *any* of their business practices going forward? Especially when their SEO services enterprise was launched on the back of calling SEOs scumbags.

How can the Google web spam team members look themselves in the mirror each morning hunting smaller webmasters and ignoring operations like Mahalo? It must begin to feel arbitrary at some point, no?

Why Mahalo (and Other Content Scrapers) Render Google's Spam Team Flaccid

I was talking to a friend yesterday who was at a conference where Demand Media's CEO spoke, and he stated that nobody asked the big question: "what if google decides they don't like you anymore?"

Then I got thinking about how Google torched Squidoo after Jason Calacanis went on his public campaign to rebrand it as spam. But today under the same level of scrutiny, how is Mahalo (which scrapes millions of 3rd party content listings *without any editorial filter*) not spam? Squidoo at least donates $10,000 a month to charity. Mahalo just "borrows" your content without permission and keeps all the cash.

In the past Google hated content scrapers pretty bad. How bad? Well a guy named Teeceo used to make scraper sites, and here is how Matt Cutts described his work:

In the chat room, I said hello to teeceo, but I know the stuff that he was doing and it’s shoot-on-sight. I think anyone who is blackhat knows (or should know) that I’m happy to talk to anyone, but that we’ll still take action on the spam we find.

Imagine taking that approach to hunting search spam all day long, and then ignoring the *fact* that Mahalo is scraping millions of third party listings and using them as content with no editorial filtering.

Then I started thinking about why the Google spam team could ignore something as outrageous as Mahalo, especially when it was built by a guy who was a false anti-spam evangelist. Is it because Jason is a good guy? No. Is it because there is some actual editorial vetting of the content? no. Is it because Google is getting a cut of the AdSense revenues? Google doesn't need the short term cash flow (look at all the affiliate AdWords advertisers they just torched), so that is too cynical of a view.

Yes Google wants display inventory (their biggest opportunity for 2010 according to the quarterly call), and these "content" websites have already given themselves over to Google as inventory. But it must be something deeper than that. So I started thinking about it from a longterm strategic level...

Google won't penalize sites like Mahalo (even though they blatantly violate Google's guidelines) because Google *wants* to use the works of companies like Mahalo, Demand Media, and Aol to lower the value of other content and bankrupt a lot of the traditional media companies.

Why would Google want to do that?

There is excessive duplication in the marketplace. The faster that duplication is driven out of the marketplace the more desperate companies will be to cut deals with Google. And while there is a down market Google can drive companies out of the market and just claim that it was the economy that did it (much like how Mahalo used the down economy as an excuse to fire most of their editorial staff and replace them with content scraping robots).

Once a lot of media companies are bankrupted, the market is far more efficient, and there are fewer mouths to feed, that means Google can squeeze greater profits margins out of the media ecosystem by getting a fatter cut of the ad revenue.

Currently this shift is risk free because almost nobody understands how the marketplace works. Sure Paul Kedrosky and Mike Arrington blogged about the search results getting spammier, but until you frequently read the above listed sequence on sites outside of the SEO industry there is no damage to the Google brand in them turning the internet into a cesspool.

Once it starts harming the Google brand then I suspect them to act quickly and decisively. And sites like Mahalo will see a sharp drop in traffic. Jason better milk it while he can. The clock is ticking.

Starving Artists in the Age of Cesspool Content

On Hacker News, Melvin, from Web Design Company, had a great analogy on the Mahalo business model:

Let's use a different industry to illustrate what is happening.

Let's say a band named The Beatles records a new album. The local radio station gets a copy of their album and plays their song. The listeners love it so they play it more often, but they don't mention who the band is and on their website, they put up a link to download the song... but without any credits. Their audience grows. They get advertisers to advertise to their audience. They say, "hey, playing good songs gets us more listeners and more listeners gets us more advertisers, which gets us more $$. Let's do this more often." So they go do this 500,000 times, and each time never mentioning who the artist is. They grow and prosper while the artists starve.

Oh, in the mean time they call the artist scum.

In the above metaphor, the artists are the bloggers whose content Mahalo is using. The radio station ripping off the artist is Mahalo. The Federal Communication Commission is like Google, who is allowing all this to continue because the radio station is giving them a cut from the advertising revenue.

Hope this helps make it a little more clear why what they are doing is wrong, needed to get exposed and needs to get fixed.

The analogy isn't 100% perfect...but it *is* pretty darn close. :D

Jason is not 100% Jim McCormick, but he isn't 0% either.

I Turned the Google Toolbar Off, But It Kept Spying On Me...

Ben Edelman: "Although I had asked that the Google Toolbar be "disable[d]," and although the Google Toolbar disappeared from view, my network monitor revealed that Google Toolbar continued to transmit my browsing to its toolbarqueries.google.com server"

Google AdWords Tax Calculator

Many experienced advertisers realize that there are many gotchas in the AdWords system...optimization tools and default setting which optimize to boost Google's yield at the expense of unsuspecting advertisers, who don't yet know what match types are or that their ads are syndicated to content sites by default.

To help new advertisers get past many of the gotchas we created the Google AdWords tax calculator - a free utility which highlights many stumbling blocks that catch new AdWords advertisers.

AdWords tax calculator.

Given that each keyword market is unique it would be impossible to make a tool that was 100% accurate in every situation, but the goal of this tool was to simply highlight common issues, and help new advertisers address them. Individual efficiency gains may be greater or smaller than the rough initial estimates the tool provides.

Please let us know what you think, as we will gladly iterate this calculator to make it better if you have some great ideas you think we should include in it. Like all of Google's products, our calculator is starting out in beta :D

Mahalo Autogenerating Spam Pages Targeting Google

Does Google like auto-generated websites wrapped in Google AdSense ads?

The short answer is no.

The long answer is a bit more convoluted. But so long as they are...

  • well branded
  • well funded
  • operating at scale
  • good at public relations

...the answer is yes, autogenerated websites full of scraped content are fine.*

*based on Mahalo.com

Mahalo SEO Spam Case Study

The Sales Pitch & Launch

Originally when launching Mahalo, Jason Calacanis claimed that it would be spam free and that SEOs would have hell to pay.

He had a multi-month sales pitch leading up to the launch of his site where he kept stating that Squidoo is spam and kept calling SEOs scumbags so he could pull in attention and links. This was well received by SEO conference organizers because people would talk about how outrageous Jason's speech was online, so (seeking marketing for their conferences) the SEO conference organizers acted like lap dogs standing in line waiting for their turn to have Jason call their paying attendees scumbags.

The publicity strategy worked great as it helped land Jason some mainstream press coverage and a lot of ditto head bloggers (who lacked either the experience or the mental faculty needed to see the bigger picture) got behind Jason.

The Wikpedia page about Mahalo reflects the public relations driven misinformed pitch

Search results quality

Mahalo's goal is to improve search results by eliminating search spam from low-quality websites, such as those that have excessive advertising, distribute malware, or engage in phishing scams. Webmasters have a vested interest in seeing their sites listed. Calacanis has said that algorithmic search engines, like Google and Yahoo, suffer from manipulation by search engine optimization practitioners. Mahalo's reliance on human editors is intended to avoid this problem, producing search results that are more relevant to the user.

When people steal/borrow/syndicate content without any editorial value add or original content, and then wrap it in ads that is generally considered spam. We will come back to that topic later, I promise! ;)

Early Media Success

Around the above conversation flowed a bunch of links, which helped Mahalo get off to a fast start. At first Jason claimed he wanted to create "the best" content for the most popular search queries. Many members of the media were duped by Jason's misinformation, as well reflected in the cNet article titled Jason Calacanis' Mahalo: Screw the long tail:

Instead of a server farm that crawls through the entire known Web so it can automatically match Web pages to the queries you type, Mahalo's search results are created by humans, in anticipation of the queries its users will type in.

How can this possibly work? Because, Calacanis says, the top 10,000 search terms account for 24 percent of all searches. If you can create great results for the top results, users will learn to appreciate the difference between machine search results--which are often thrown off by spam and poor-quality links--and human-powered search pages, lovingly created by caring search editors. For the obscure "long tail" queries that make up the 76 percent of search terms, Mahalo will serve up Google results.

Their first x articles were typically thin link lists, but hand generated. But since the pages were just link lists they were not remarkable enough to be linkworthy and the service was not sticky enough to keep people coming back. So Mahalo also decided to ramp up link building & awareness using 4 strategies:

A person who claims to have worked for Mahalo named Matthew Wayne Selznick wrote:

Regarding the Mahalo Blog Network: I don't know how recent that screenshot is, but it's amusing to see the blogs of several people who have either left the company or were laid off last October, when half the in-house editorial staff (including myself) was purged.
...
When I was working for Mahalo, staff were strongly encouraged to get blogs if we didn't have them and blog about Mahalo whenever there was a high-traffic opportunity like an awards show, sports or political event.
...
I unsubscribe from the blogs of my former co-workers when the majority of their posts are Mahalo link parades, just as I unsubscribe from any blog when it becomes a mouthpiece.

Their content was not Pulitzer prize level, but the strategy paid off and they started pulling in search traffic.

Strategy Shift

In spite of claiming that he just wanted to dominate the short head of search volume, that is not how Mahalo started gaining search traffic. Even if they poured hundreds of Dollars into a piece of content the generalist content with little to no topical expertise could not compete for the most competitive and highest traffic search keywords.

You need to have something useful or original to add to the conversation if you want to compete for the most competitive keywords, and penny pinching outsourced content doesn't get the job done there.

Instead what happened was that they ranked almost instantly for keywords like "best computer speakers" even with low quality scraped content.

Around the time I highlighted the emergence of that strategy, Google's Matt Cutts was interviewed about it and claimed that it was fine because Jason Calacanis was using MediaWiki to create his site. Jason also did a bit of damage control in a Sphinn comment where he claimed the spam pages were "experimental pages" that "we are no indexing"

In his own words:

That was 671 days ago. What has happened since?

A Prediction

Around the time of the above incident John Andrews (who gets the SEO field as well as anyone does) stated:

Everyone just copy Jason Calcanis and Mahaloo, ok? That sounds like a GREAT idea. Jason dissed SEOs in public, at a keynote, on purpose, and then learned a bit so he wasn’t quite so ignorant of SEO any more, and is now working the SERPs as a black hat SEO. Jason dissed affiliates in public, at a keynore, on purpose, and then learned a bit so he’s not as ignorant of affiliate marketing as he was before, and now Mahaoloo has embedded (inline) affiliate links (take a look.. added since Affiliate Summit). I think every "Learn how to Make Money Fast on the Internets" web site should simply point to Mahaoulo and say "copy them.. they are riding the black edge of gray hat SEO" and be done with it. So simple... just copy them. As they add pages, add splogs on those same topics because those are money terms. Every time they link to some resource, link to it from that blog. Scan technorati for Jason’s comments, and add one of your own right into that thread.. every time. Let Jason pave the way to profits.... each time he justifies his spam, he’s justified YOUR spam as well. Every time he explains how he’s not a spammer, he’s explaining why YOUR not a spammer either. Best of all, he’s being your spokesperson for FREE!

Was John Andrews once again correct? Lets take a look behind the curtains :D

What Happened?

Well the above computer speakers page that was highlighted still ranks in the top 5 search results in Google.

And the site has been growing quickly, with traffic increasing at least 3-fold over the past couple years.

Jason used the economic downturn as a convenient excuse to fire most of their editorial staff. But a big piece of that traffic growth is that they have got more sophisticated in their content scraping strategy.

To appreciate how reliant their model is on scraping content, I want you to see how a new page starts off.

Once you strip the ads and scraped content from that page there is nothing left but branding & navigation.

Two other noteworthy things about that page are that it was generated by a robot (see below) and that it is already indexed in Google. Once you have enough domain authority you can publish automated scraped garbage and rank well in Google. It is the Mahalo strategy.

That page (which was automatically generated in under a minute by a fake user robot named searchclick) is already ranking well in Google! How do you know searchclick is a fake user? Well look through all the different pages they created in under a minute over the course of the last year...likely 10,000's of them.

Understanding the Insidious Nature of Mahalo's Scraping

Search engines like Google scrape content so that they may provide a service of value to end users *and* publishers. When they make your snippets they are used to help promote your website.

What Mahalo does is take snippets, and publish them as content on their site. So they use your page titles and your content snippet to rank their site using your content, without your permission.

If you optimize your page titles on a new blog post you are helping to feed relevant optimized content into the Mahalo machine. They will scrape it, and if you are less authoritative than they are, they will likely outrank you!

To add further insult to injury, they put nofollow on links back to the content source which they are scraping content from, so while they are "borrowing" your content you are not getting any link credit for it.

And It Gets Worse!!!

As abusive and as extreme as the above sounds, it is actually only the first step in the process.

What happens next is that if your content (published on Mahalo without permission) causes the Mahalo page to rank for new valuable keywords then they may feed those keywords into their page generation tool and keep making more auto-generated pages in that area, leveraging their domain authority and YOUR content to compete against you while building an automated spam empire.

Some of the top earning pages might have freelancers thicken them out, but the only reason humans are involved at that stage is to legitimize the mass content scraping farm that is the base of the operation. If a company has 200,000+ automated pages with 0 overhead that make 5 cents/day each that is real cashflow - $10,000+ per day of profit!

Still not convinced of the profit potential? Mahalo.com has ~ 300,000 pages indexed in Google. On auto-generated pages it is far easier to get people to click an AdSense ad than it is to get them to buy something from Amazon.com (and you profit on 100% of the ad clicks vs only 1% of the Amazon.com clicks that convert). While there are 4 AdSense blocks *above* the Amazon.com affiliate links, Jason did $250,000 on Amazon's affiliate program last year "without trying" (again, his own stats in his own words...see Flickr.com/photos/jasoncalacanis/4234615626/ ).

Putting it All Together

If you build link equity and are good at public relations you can get away with murder in Google. Scale it big enough and the guidelines simply do NOT apply to you.

Most people who try to "pull a Mahalo" and spam up Google will likely fail because they lack

  • the public relations & affiliations needed to attempt to legitimize such a strategy
  • the willingness to lie just to get a bit of media ink
  • the public relations & media savvy to pull such a major bait and switch without getting caught
  • the domain authority to make it work algorithmically

Originally when launching Mahalo, Jason Calacanis claimed that it would be spam free and that SEOs would have hell to pay. Now that he is scraping your content (and adding nofollow to the links to your content) I think he is right. You are losing out on your search traffic because an authority site is "borrowing" your content and outranking you with your own content.

Jason got Squidoo penalized by calling it spam, and under the same level of scrutiny, how is Mahalo which scrapes millions of 3rd party content listings *without any editorial filter* not spam? Squidoo at least donates $10,000 a month to charity. Mahalo just steals your content without permission and keeps all the cash.

Are the search results going to start filling up with Twitter recycling start ups? What happens when the media gets in on this "what the bloggers have to say" scraping game? Does it even matter who created the content so long as someone wraps it in ads & ranks it?

I don't think we can stop people from being greedy or stealing, but I am surprised Google has turned a blind eye to this process. Is this what they want the web to become?

Open Site Explorer is Pretty Slick

From a marketing and a public relations standpoint this tool is brilliant. Rand just put up a full in-depth review here.

In the past I have not been a fan of certain outing policies, but of late I have seen that practice has went away...and if it stays that way how could I not recommend the above tool?

Sometimes it is hard to appreciate how spoiled we are as SEOs with cheap to free keyword data, cheap to free great link data, and lots of useful tools to help us organize and make sense of it all. And even lots of charts :)

One area where some of our tools could be better is on the usability front...we tend to presume some level of knowledge and/or the willingness to work through things to figure them out, but the presentation on OSE is very easy to grasp & understand at first glance. Part of this challenge comes from limited resources...and the most limiting one being time. It is so hard to make money servicing the SEO market (because there are so many great free options). As the market continues to open up more with more tools and options, at the same time the SEO *process* keeps getting more complex, with more competitors jumping into the SEO market.

It certainly feels like it will keep getting easier to make money as a publisher rather than a person servicing the SEO market.

But not all forms of publishing will get easier & more profitable. Companies like Demand Media and Aol sharing their results publicly will saturate some segments, but there are many areas where bullshit content won't be enough to compete. And some thin operations will see margin contraction as the investment needed to stay competitive increases.

But we are quite literally drowning in opportunity. If a person can't make money as a publisher with SEO knowledge, it is simply because they are not willing to put in the effort (or they are part of an old bureaucratic publishing company which moves slowly, is debt laden, and has a high cost structure).

I have always avoided scaling as a company, but there is so much opportunity that I might have a resolution for 2010 :D

eBay SEO: an Interview of Dennis Goedegebuure

Dennis Goedegebuure, aka DennisG, is the head of eBay's in-house SEO team. After seeing him make some great posts in our forums and chatting a bit back and forth I asked if he would be up for doing an interview about SEO. And the result is the following 12 pages full of great actionable tips for anyone looking to learn more about in-house SEO best practices. Thanks to Joost for introducing us.

What is your background and how did you get into SEO?

After I finished a master in Economics at the University of Amsterdam, I started at eBay in April 2002 in the Internet marketing team for eBay Netherlands and Belgium. Just 5 months before eBay had acquired iBazar, the European clone active in a large number of European countries. iBazar relied heavily on traditional ways of marketing like TV, radio and print. eBay invest the majority of its marketing budget in direct acquisition of customers through internet Marketing. So I was hired as the second employee in the IM team.

During my tenure at eBay.nl, I worked at direct portal relationships like Yahoo & MSN, did some early paid search keyword buying with Google when they entered the Dutch market, and worked on the acquisition of the largest classifieds site in The Netherlands, marktplaats.nl.

To become better in communicating with our local developers, I started to learn code languages. HTML was obviously the first one, and definitely a must have skill set to become more effective explaining what I needed from the developers in the projects.

However, your skills in html become rusty very fast if you don’t use them, so I started coding my own websites. As we live and breathe data as internet Marketers, I was definitely intrigued by the potential of SEO as a traffic source. Since I didn’t want to invest money yet in these sites, I only had time to invest to drive more traffic to my sites. SEO seemed a good way to get more traffic.

Within eBay you have the ability to control your own destiny if you take action. If you would like to move to another job, you can work your way into it. After the acquisition of Marktplaats.nl, I took a broader role in SEO for that site, as well as the Natural Search projects for eBay.nl. Marktplaats has it’s own development team. Which is not the case for eBay.nl, which is on the global platform where the majority of the product releases are driven out of the San Jose product teams?

In 2004 I was invited for a trip to eBay Marketing College, where I met my future manager in the US. A year later I got the chance moving to San Jose, in a job to coordinate the global Natural Search projects. At the time we had local teams working on Natural search, and there was a big need for best practice sharing and coordination of the global projects.

Now, 2009, I’m working in a centralized team in San Jose, where we are responsible for the Natural Search traffic for all eBay global sites. We consult on the Classifieds sites and on PayPal where needed. And we have very good relations with the in-house SEO teams of the Classifieds group, Shopping.com and Stubhub.

Over the last three years, I have consulted on SEO with Skype, StumbleUpon, Rent and half.com. It has been fun to see the different challenges and the different solutions the teams bring to the objectives they set for the SEO projects. And I learned a lot about SEO and scalability.

A large number of people have shaped me in my thinking about SEO. Among them well known names in the SEO industry like yourself, Danny Sullivan, Vanessa Fox or Michael Gray. Thank you all for sharing the wealth of knowledge!

One particular colleague that has made a lasting impact how I work has been Alex Schultz. Alex is just an incredible smart guy, who has such amazing diverse background knowledge in Internet marketing. I would work with him in any team, at any time again!

You do SEO for one of the largest online websites and yet you also run a few of your own websites. How would you compare the differences between your enterprise level efforts and what the average SEO experiences working on smaller websites?

I use my own websites to test small tweaks or new techniques in the broadest definition of Internet Marketing. I’m learning everyday from other people online. It’s important to make sure you are not being focused on one traffic source too much, and not to become too specialized.

On large scale, enterprise websites it’s extremely important to think about the long term impact of certain changes. A site like eBay is like an oil tanker at sea. Where you can make fast changes on your smaller website, which can be easily rolled back, on a large site like eBay, the product roll out process is much more complex. As eBay has been a large target for phishing in the past, a great number of extra security checks are required.

For enterprise websites you would need additional skill set to be more effective. Where in the smaller websites you can rely on getting your requirements in using your technical skills talking with developers, in the larger organization you would need to manage projects and resource allocation through other managers. Those managers might have different incentives or maybe even a different political agenda. Getting your work done in that environment requires the in-house SEO to have a lot of persistence and patience.

What are some of the things you have done which you have found to be most beneficial in helping to evangelize SEO and get buy in from other managers?

Sometimes it pays off to get somebody from the outside who can embarrass all the things that go wrong from an SEO perspective. As building connections with the rest of the organization is essential for the success of your future partnerships within the organization, you can hardly flame all the SEO misses in front of a large audience.

I’ve done this a number of times and had some good success getting the attention SEO needs. It may have helped that I got some senior product folks into the session who have become the biggest SEO ambassadors in the company.

Having these senior folks helping you can catapult your career as well. As an in-house SEO you may find yourself in between different departments. Having a sponsor in each and every department will help raise your profile among more senior people, who can help you in your next projects, career moves or just with advice how to deal with complex problems.

You mentioned that people should not be too focused on any 1 traffic source online. What are some of the best things smaller businesses can do to help lessen their reliance on search? What types of businesses & products work best with leveraging eBay as a source of customers?

Link building in the broadest form. Even no-follow links will help any small business to grow in traffic. We as SEO’s are so focused on the link as a means to improve rankings, where we have forgotten the real function of a link. A link is “linking” two documents to each other for easy navigation of the user.

Links are good for generating traffic. Getting more links to your pages/site, will generate more traffic. Early this year I gained a link from Valleywag to my blog. Looking back at 2009, this single link was the second source traffic to my site!

Furthermore, think about StumbleUpon. Stumbleupon can still drive a significant amount of good traffic to your site, as long as your pages are tagged in the right category in SU. I’ve sent the post from Darren Rowse, Why StumbleUpon Sends more Traffic Than Digg, to a number of starting entrepreneurs. Also Brent Csutoras had a more recent post this year how StumbleUpon is one of his major sources of traffic. Read for yourself at: The Stumble Effect: StumbleUpon Hits the Big Leagues.

StumbleUpon is the gift that keeps on giving” I always say. One of my sites gets hit almost once a month’s with a peak of traffic from SU, (see picture below). This can be a great way of lowering the reliance of your site on search as the main source of traffic.

As an SEO consultant and blogger writing about the latest changes in SEO, I (and other folks) sometimes try to figure out how algorithmic shifts might play out. From your experiences with eBay, do some of us bloggers tend to over-emphasize what might happen? Due to the gravity and strength of your network of websites, does eBay end up seeing far less volatility than smaller sites end up seeing?

I don’t think SEO bloggers over-emphasize what could happen. It’s just you might have other conclusions from what you see happening in the rankings or traffic to your sites than others do. Each site reacts differently from algorithmic changes, as each site has a different link profile, content focus or site architecture.

All you can do is report what you see happening when an algorithmic shift is happening. What I would encourage SEO bloggers to do, is ask more questions for their readers to respond on. What you might see, might be different than what others see. Learning from the responses might give you new insight.

When it comes to seismic shifts in the algorithm, we don’t see that much volatility in traffic. Where one page type might lose, another one can gain. The same with keywords, where we might get more traffic on branded searches, there might be a loss in generic product name searches. The larger the site, the less volatile the site can be to algorithmic shifts.

Now, having said that, it’s still difficult to see what the real impact is on a site like eBay from changes like Vince or Caffeine. We will see when we all get caffeinated, but given that eBay has invested in site speed, has relevant content for online shoppers looking for great deals and is growing in number of items for sale, I expect eBay to do well getting more visitors through Natural Search.

What are some of the easiest things to mess up when working on a site of that scale?

Working on a large scale website, will usually mean different teams are working on different parts of the site. These teams will have people leaving, and new people joining. Without having the proper best practice sharing or historical SEO learning’s in place, you will find yourself running behind every project in flight to get your requirements in.

As an in-house SEO team, we are self promoting the team on a continuous basis, build new connections as people come in, scan for projects that might become critical for our success and go after the owners of these. I would say, the easiest thing to mess up traffic working on a large scale site is losing overview of what the organization is working on, key objectives for the organization and how SEO can drive/contribute to the overall objectives. Being plugged in, is key to stay on top of everything. Here it comes down to what we call in Europe: Fingerspitzengefühl.

After that, all technical changes are a matter of prioritization and resourcing. Based on our assumptions we need to show the possible downside or the upside on any of the tradeoffs that are made.

Have you ever had any happy accidents where someone changed stuff without mentioning it, causing an increase in traffic?

Yes, just recently a renewed internal focus on site speed has also shown some good increases of traffic in Natural Search. I was aware of the renewed focus, where I actually kicked off some of the discussions back in 2007. Now that site speed is becoming more important as a ranking factor, the projects to enhance the speed of the eBay pages might pay off more in 2010.

What all success metrics do you look at when evaluating general changes to a site of that scale?

Traffic. Traffic and conversions.

I don’t believe rankings will tell you a whole lot, as this varies too much across data centers, personal search or location based on IP targeting. Rankings can only be directional, not actionable. At eBay, the majority of traffic is on long tail keywords. The amount of keywords that we are getting traffic on, is so large, that we hardly be able to track any of the positions. So I sometimes do some rank checking with your rank checker, but only from home not from the corporate IP address. But with rankings comes traffic. So even if Rankings are not a leading indicator of your success, rankings will produce the traffic which is your objective.

Estimating traffic impact of any changes on a small site is difficult, but you can easily manage the risk rolling back any of the changes. On a large scale site, it’s much more difficult to roll back any changes in infrastructure. Even test results on my own site generally will not be a good proxy of the impact similar changes will have on the larger eBay sites.

This is where search engine guidelines and user experience will come in. Taking the long term strategic approach, we don’t want to lose rankings and we don’t want to lose traffic. What is good for our users, most of the time will be good for search engine rankings.

You mentioned that a lot of your traffic comes from longtail organic search. Across the search marketing field as a whole, there is an amazing budget gap between paid search and organic SEO, where organic SEO offers higher returns but is typically done on a small fraction of the budget. As a marketing investment, why do you feel SEO has lagged paid search? Have you noticed competing businesses shifting more resources into SEO lately, or is it still way behind? What might cause further SEO investments at companies large & small?

I strongly believe the gap between the investments in paid search and SEO are caused by the direct response effect of Paid Search. As a business it’s easier to predict how the sales will react tomorrow on the dollar invested today.

For SEO, it’s always hard to predict the outcome of any investment. I actually struggle a lot with this internally. We have to compete with other teams over product resources for the core site development. If another team has come up with a new seller feature which they predict to increase revenue by a couple of million dollars, it’s hard for me to secure any of the resources based on a competing revenue estimate which might have a lower accuracy level.

With the economic downturn earlier in 2009, paid search budgets have seen a decline. You could see that in the growth numbers from Google, where “only” a 3% Y-Y growth in the second quarter was realized. However, I felt the increase in investments in SEO across the board. I got more job offers and headhunter calls than the years before. Also, it was more difficult to fill open positions in our team, where in-house SEO people were in higher demand.

The more small & large companies become aware of the power of SEO, good rankings, long tail keyword traffic and the search based user behavior, more companies will start to invest in SEO. This sounds like nothing new we have seen in the last couple of years, but in the coming years, the space will become even more crowded. There are only 10 first page results, and sometimes not even 10!

Some small companies will use consultants and pick for some good advice to fall back on DIY implementation, larger companies will probably want to employ a full time in-house SEO person/team. You can see this trend clearly with the rise of specialized streams in the SEO conferences focused on in-house SEO.

When you run a site that large, is there any easy way to phase in tests while minimizing risks?

No. As product life cycles are fairly long compared to other, smaller websites, there is less opportunity to test on the core site. And even if you can run a test, we have to keep in mind that more than 1.5 million people rely on their eBay sales for their primary source of income. We service these people to make sure they are successful. Driving traffic to their items for sale is our most important objective.

Now, that does not mean we don’t do test at all. We have a number of initiatives where we test, and luckily I have a VP who used to run the Natural Search channel. He understands how important testing is. We get a lot of freedom to deploy smaller initiatives off the core platform to do some testing. Actually these test projects are paying for themselves as the revenue derived from the test sites outweigh the costs in the long run.

One example of our test projects, the New-Pulse (currently we are having some smaller issues with the cronjobs, will be fixed soon) was a way to tap into the wisdom of the crowds of successful bloggers. My intention for the project was to have blogs like Gizmodo and Engadget do what they do best; bring the newest gadgets to their readers, and we analyze what products will become winners. I published about the project here, after I got questions how it worked at the Jane&Robot session in San Francisco. This particular project gave me a lot of new ideas what I can do with our internal data, and how to leverage the broader data streams that you can find all over the net.

Small anecdote; based on the insights from the New Pulse, I found out there is an active knitting community who knit socks during the months of October, calling it Socktoberfest. Pictures of the socks are being shared on Flickr. Here you can see how I picked up this trend.

When you guys implement strategic changes does traffic sometimes do a head fake and go the wrong way before moving the direction you expect it to?

It depends what you mean with Strategic changes. If we chance the focus towards a certain category, traffic might increase immediately because of the higher exposure that category gets through our PR efforts. If strategic changes mean product changes, the traffic can be impacted to a large extent.

This is why the Natural Search team at eBay has been growing for the last couple of months. There are so many product changes and projects initiated, that we will need to leverage the product teams as much as possible.

To answer your question, yes, we have seen this happening. Over 2009, we introduced a number of side wide changes. For a long time, eBay has been known using notorious URL structures. The usage of “maverick” URL’s was causing more pain than it did any good. Removing the double encoded parameters from the listing URL’s, and introducing the canonical URL tag, caused a first drop, followed by traffic to stabilize again at prior levels. However, the expectations are to see traffic increase over the coming months because of less duplicate URL’s for the same page.

When a lot of your content ends up being user generated, how do you encourage your users to optimize it to help bring in more search exposure?

Our community of sellers is extremely smart in getting more traffic to their own items. Some of them are getting really creative, and have become good Internet Marketers themselves, without even knowing it.

If you are a seller at eBay, and you would like to become successful, you would do activities that resemble the activities of most SEO’s. Keyword research, title/headline construction, quality content in the item description, good pictures for the window shopper, and maybe even some social media on- and off eBay.

However, their success stands or falls with the tools that eBay provides the sellers. For years we have special tools for the sellers that have an eBay store. Custom categories, larger images, store descriptions at the top of the page, custom page title optimization tool. We have a number of help pages describing these functions. This reminds me I have to start a project to update these!

Furthermore, eBay has a top sellers outreach team. A former colleague of mine from the International Marketing team is now working on that team. She reaches out to me pro-actively to get top ranking factors or tips into their customer outreach scripts.

Next year, we will conduct a dedicated SEO best practice sharing session with the team in Salt Lake City to educate them on SEO. While we are there, we probably will be spending some time with our Customer service representatives to understand how they can help the community of sellers becoming more successful through integrating SEO into their listings.

If a seller is looking to maximize the exposure of their eBay auction listings or stores do you ever recommend them driving traffic with paid search or building links into their pages from other sites? If so, what are some of the techniques you have seen sellers find most effective for increasing the exposure of their eBay listings?

To my knowledge, we have not actively promoted buying paid search ads to our sellers. More so because of the double serving restrictions from the search engines. We have been fairly successful in driving paid search through our PS platform, where we have included stores as a landing pages as well.

I have seen sellers becoming very successful in promoting their items through personal blogs. They even make money on the traffic using the eBay Partner Network.

This year we also re-launched the keyword buying program on eBay. Sellers can get more traffic to their listings using Adcommerce. In Adcommerce the seller can bid on keywords to have an ad appear on the search result page and drive more traffic to their listings or eBay store.

A lot of your content ends up cradle to grave quickly...where there are millions of new listings and millions of expired listings going through the system. What are some of the keys to helping search engines understand the structure and importance of content in such a fast changing environment?

As the most important content is hidden deeply in the site, and like you said ends up quickly, discovery has become one of our primary focus points.

We have invested a good amount of resources in our data-feed technology & analytics. The Sitemaps protocol plays an important role here. As eBay has so many new listings every day, over the course of the day, you can almost update the sitemaps on a continuous basis.

However, the effort to source the items from the database, generate the sitemap files, submission and pick up, takes decent amount of time. We have made good headway tuning our feeds in way to get more efficiency out of the items we send. We started optimizing based on probability of conversion. We can make these assumptions based on predictive modeling and data. Predictive modeling on large datasets will become even more important when it comes to scaling the projects. As a company, we are putting lots of efforts in building out a competitive advantage based on analytics, predictive modeling and scaling the technology to handling even larger datasets.

Next to the Sitemap submission, eBay makes sure certain trends and categories are being communicated through PR efforts. For 2010, you can keep an eye on what’s hot in Pop-culture and fashion on eBay by keeping track of The Inside Source. Here you can find stories behind the data on eBay.

Has the verticalization of search created more opportunity or less? Do you guys devote a lot of resources toward vertical search databases?

Up until now, we have only focused on the shopping verticals, as the shopping comparison sites. Here we have invested in specific feeds where we push items to their sites based on our optimization algorithm.

Our classifieds sites, which are more locally organized, have done more on the local optimization. They also play in the housing and job markets, which makes it more relevant to optimize for the local or vertical search players.

People sell some of the most remarkable items on eBay, and sometimes items can generate quite a bit of buzz before the listing ends. When listings end for buzz-worthy and well linked to items is there any way to capture that built up equity?

Currently, we distinguish between 3 types of View Item Pages. Open, closed, Expired.

Open, means the item is still for sale, which can be between 1-30 days, depending on the sales format. We also have a format for store listings, which has a duration of good till cancelled.

Closed, means the item has just been closed, but will be available longer for review. The content lives in the database, and the page is still available on the same URL as before. We actually see that our community finds these pages very helpful in their purchasing process to look up historical prices.

Expired, means the item is no longer available for review. The URL will give a 404 error, displaying a message the item has ended or has been removed.

There have been some attempts to capture the link equity from the buzz-worthy eBay items in the past. A couple of years ago, a project was launched called: “Best of eBay”. This was essentially a digg-kinda site, where community could vote for the best and weirdest items. Unfortunately, the site was not designed with the eBay community in mind, and poorly marketed. It failed to live up to its expectations, and the project died.

You are right that there might be a good way of capturing more of the incoming link equity on the rare and buzz-worthy items. I recently even bought a book on eBay, which listed all the rare and viral items over the years. Thinking about all the links that went to the Virgin Marry Grilled Cheese Sandwich, makes me excited. Maybe not a lot of people will be searching every day on a sandwich that displays the Virgin Marry, but at least you can sell a lot of toasters around it!

I sometimes browse around the strange items that are for sale in search for link bait ideas. The strange eBay items are a perfect fit for pure white hat link bait. Just check out this Elvis Personally owned/worn Lion Claw Necklace that sold for almost $30K, or the auction of the popular PVRblog.com site, starting at $0.99, going for more than $12K.

For 2010, I might start a new pet project that will tap into the wealth of strange and funny items getting PR attention around the globe. IMHO as long as the project drives value for our customers, it will be successful in the search engines too. And will be a lot of fun to play around with.

You guys have more data than many search engines do. How do you leverage it help define your SEO strategy?

I really love the eBay data! I have made it my mission, and a pet project, to do more with this data in the future for eBay and the seller’s community.

The eBay site is not only a marketplace, where buyers and sellers can find each other for common or rare products; eBay is also very much a search engine which reflects shopping intent. This shopping search volume is accompanied with conversion data. Based on keywords, or product searches, we track what sells and what does not get sold.

Our paid search colleagues are world class in building predictive models for the conversion rate per keyword. For over 5 years, the paid search teams have squeezed more efficiency out of the paid search budgets to get more for the same investment.

On top of this predictive modeling, the technology team has build our own paid search platform, which makes it easy to scale large amounts of keywords, optimizing for the highest ROI, across multiple countries and platforms.

If you have large amounts of data, it will become more important to invest as a company in analytical and technical resources. You need the analytics to understand what the data can tell you, on which you can form actionable projects to drive more efficiency. You need technology investments to build the platforms to execute against the learning’s the data has told you.

One good example of this was the outbreak of the Zhu Zhu Pets as THE toy for the Xmas shopping season this year. A large number of online data providers have reported on the popularity of the little mechanical hamster right after Black Friday/Cyber Monday. I spotted an increase in search volume on the eBay site back in September, while digging through some internal eBay search data.

Thinking about your career path and how many things worked well for you, what were some of the keys to so many things falling into place for you? If a person wants to become an enterprise level SEO, what are the key things they should focus on learning & doing?

In 2005 I read the book: “Who Moved My cheese”. This changed my life in so many ways, as it changed my attitude towards change. Change is all around us. The way you react on changes around you can impact your success in a big way. One particular rule from the book that made me change myself and the path of my life is: “What would you do if you weren’t afraid?”

I thought that was a wise lesson, and it got me to the point in my life where I’m currently at. I had the opportunity to move to the US for a job that I wanted. If I would have acted out of my fears, I probably would not have done it. But facing the fears, and what these really were, it became really clear for me that I always could return back to The Netherlands without losing too much.

If you want to become an enterprise level SEO, you should do three things:

  1. Read the book: “Never eat alone” and start learning how to build connections and relationships asap!
  2. Learn from the tech teams how scaling large websites work, and about the problems which can arise from changing the infrastructure
  3. Keep learning more SEO on a daily basis.

---

Thanks a bunch Dennis!

To keep up with Dennis check out The Next Corner (or its Dutch counterpart). And if you use Twitter his handle is TheNextCorner.

Pages