Search Technologies
 
Forums: » Register « |  User CP |  Games |  Calendar |  Members |  FAQs |  Sitemap |  Support | 
 
 
User Name:
Password:
Remember me
Go Back   SEO Chat ForumsSearch Engine StrategiesSearch Technologies

Reply
Add This Thread To:
  Del.icio.us   Digg   Google   Spurl   Blink   Furl   Simpy   Y! MyWeb 
Thread Tools Search this Thread Rate Thread Display Modes
 
Unread SEO Chat Forums Sponsor:
  #1  
Old April 5th, 2005, 01:59 PM
randfish's Avatar
randfish randfish is offline
SEO Chat Intermediate (1500 - 1999 posts)
 
Join Date: Jul 2004
Location: Seattle, WA
Posts: 1,874 randfish User rank is Sergeant (500 - 2000 Reputation Level)randfish User rank is Sergeant (500 - 2000 Reputation Level)randfish User rank is Sergeant (500 - 2000 Reputation Level)randfish User rank is Sergeant (500 - 2000 Reputation Level)randfish User rank is Sergeant (500 - 2000 Reputation Level) 
Time spent in forums: 6 Days 12 h 54 m 32 sec
Reputation Power: 11
Lightbulb Keyword Density - A Bogus Metric

<removed>
Comments on this post
GeoffreyF67 agrees: Are you a professor in real life?
AltherrWeb agrees: Here's your rep - 6 months later

Last edited by randfish : November 18th, 2005 at 03:11 PM.

Reply With Quote
  #2  
Old April 5th, 2005, 04:20 PM
Robert Paulson Robert Paulson is offline
Contributing User
SEO Chat Novice (500 - 999 posts)
 
Join Date: Feb 2005
Posts: 528 Robert Paulson User rank is Corporal (100 - 500 Reputation Level)Robert Paulson User rank is Corporal (100 - 500 Reputation Level)Robert Paulson User rank is Corporal (100 - 500 Reputation Level)Robert Paulson User rank is Corporal (100 - 500 Reputation Level) 
Time spent in forums: 1 Week 13 h 2 m 2 sec
Reputation Power: 5
Let's lower the rep-spreading limits so I can give Ranfish more rep!


I'm glad to hear this, because I'm working on a new site, and I'm having a really hard talking about the thing without saying the thing too many times.

Hard to talk about the benefits of red widgets without saying "red widgets" about a dozen times. Thankfully, the sites I'll be competing with mention red widgets quite a bit more than that.

Reply With Quote
  #3  
Old April 5th, 2005, 07:25 PM
xan's Avatar
xan xan is offline
Contributing User
SEO Chat Newbie (0 - 499 posts)
 
Join Date: Oct 2004
Location: UK
Posts: 108 xan User rank is Lance Corporal (50 - 100 Reputation Level)xan User rank is Lance Corporal (50 - 100 Reputation Level)xan User rank is Lance Corporal (50 - 100 Reputation Level) 
Time spent in forums: 20 h 9 m 25 sec
Reputation Power: 5
hey-ho,

I have no idea about the answers to the question of it being bogus.

It definately isn't in Knowledge Discovery in Databases (KDD), digital libraries, bibliographical libraries, and almost every IR system, its valuable knowledge when all you have is some text, its actually a good place to start.

This information allows us to quickly assess regions within the documents that are “hotspots” insofar as they exhibit high query-keyword density or occurrences of a certain word or word combinations (Burstiness). Its quite important for automatic news for example, to know what topic is hot.

There are many different ways of getting those figures:
Hanning windows are one way,and used to weigh more strongly the data values at the center of the finite range, and applies less weight to the values at the extremes of the range. You can use Apply Bayes’ Rule and with non-parametric density estimation. The distribution of the number of keyword vectors in a Web page can tell you that too.

What we're looking for is the frequency distribution patterns.

If you like you can use the ratio between keyword number and total word number in a page and domain for a quick and simple estimation, as is commonly used in SEO.

I know its invaluable in my work. But obviously not used on its own.

Reply With Quote
  #4  
Old April 5th, 2005, 07:46 PM
randfish's Avatar
randfish randfish is offline
SEO Chat Intermediate (1500 - 1999 posts)
 
Join Date: Jul 2004
Location: Seattle, WA
Posts: 1,874 randfish User rank is Sergeant (500 - 2000 Reputation Level)randfish User rank is Sergeant (500 - 2000 Reputation Level)randfish User rank is Sergeant (500 - 2000 Reputation Level)randfish User rank is Sergeant (500 - 2000 Reputation Level)randfish User rank is Sergeant (500 - 2000 Reputation Level) 
Time spent in forums: 6 Days 12 h 54 m 32 sec
Reputation Power: 11
Xan -

You're saying it's used to identify relevant documents, but not neccessarily for rankings - is that right? It makes good sense - a document with lots of mentions of a term is probably at least on the subject!

Reply With Quote
  #5  
Old April 5th, 2005, 07:47 PM
EGOL's Avatar
EGOL EGOL is offline
EGOL
Click here for more information.
 
Join Date: Jun 2003
Location: Snow belt.
Posts: 6,796 EGOL User rank is Second Lieutenant (5000 - 10000 Reputation Level)EGOL User rank is Second Lieutenant (5000 - 10000 Reputation Level)EGOL User rank is Second Lieutenant (5000 - 10000 Reputation Level)EGOL User rank is Second Lieutenant (5000 - 10000 Reputation Level)EGOL User rank is Second Lieutenant (5000 - 10000 Reputation Level)EGOL User rank is Second Lieutenant (5000 - 10000 Reputation Level)EGOL User rank is Second Lieutenant (5000 - 10000 Reputation Level) 
Time spent in forums: 1 Month 2 Weeks 5 Days 2 h 7 m 37 sec
Reputation Power: 75
Thanks Randfish... great post. I have never paid attention to KW density - but more by luck, wanting to write naturally and being lazy. Also, I get LOTS of traffic from words that appear just one time on my pages.
__________________
* Its not the size of the dog in the fight that matters... it's the size of the fight in the dog.
* Free advice generally isn't worth much, but cheap advice is worth even less.

Reply With Quote
  #6  
Old April 5th, 2005, 08:07 PM
xan's Avatar
xan xan is offline
Contributing User
SEO Chat Newbie (0 - 499 posts)
 
Join Date: Oct 2004
Location: UK
Posts: 108 xan User rank is Lance Corporal (50 - 100 Reputation Level)xan User rank is Lance Corporal (50 - 100 Reputation Level)xan User rank is Lance Corporal (50 - 100 Reputation Level) 
Time spent in forums: 20 h 9 m 25 sec
Reputation Power: 5
Quote:
Originally Posted by randfish
Xan -

You're saying it's used to identify relevant documents, but not neccessarily for rankings - is that right? It makes good sense - a document with lots of mentions of a term is probably at least on the subject!


That's right. It's not used to rank anything, why should it be, of course many other factors have to be considered and then those all need to be weighted as well, before these measures are even taken into account in the overall calculations in the wider document space! And it does go on, but I'll stop there.

So...it is used for ranking, ... but in the same way that an exhaust pipe contributes to a car working.

Reply With Quote
  #7  
Old April 5th, 2005, 08:19 PM
randfish's Avatar
randfish randfish is offline
SEO Chat Intermediate (1500 - 1999 posts)
 
Join Date: Jul 2004
Location: Seattle, WA
Posts: 1,874 randfish User rank is Sergeant (500 - 2000 Reputation Level)randfish User rank is Sergeant (500 - 2000 Reputation Level)randfish User rank is Sergeant (500 - 2000 Reputation Level)randfish User rank is Sergeant (500 - 2000 Reputation Level)randfish User rank is Sergeant (500 - 2000 Reputation Level) 
Time spent in forums: 6 Days 12 h 54 m 32 sec
Reputation Power: 11
Quote:
Originally Posted by xan
So...it is used for ranking, ... but in the same way that an exhaust pipe contributes to a car working.

That's a beautiful analogy Xan. So appropriate.
Comments on this post
GadgetsGuy agrees: Another great analogy us Bimmer owners have come to appreciate, is that Bimmers do not come equipped
with good cup holders, for the exact same reason that coffeetables do not come equipped with
steering wheels! lol

Reply With Quote
  #8  
Old April 6th, 2005, 10:39 AM
Phoenix's Avatar
Phoenix Phoenix is offline
Contributing User
SEO Chat Beginner (1000 - 1499 posts)
 
Join Date: Jan 2003
Location: Texas!
Posts: 1,135 Phoenix User rank is Corporal (100 - 500 Reputation Level)Phoenix User rank is Corporal (100 - 500 Reputation Level)Phoenix User rank is Corporal (100 - 500 Reputation Level)Phoenix User rank is Corporal (100 - 500 Reputation Level) 
Time spent in forums: 3 Days 9 h 4 m 3 sec
Reputation Power: 8
Send a message via AIM to Phoenix
Quote:
erode the myth of kw density


Rand, the myth is eroded already my friend. No more KW densities. This is old stuff. When I started 4 years ago, we worked on keyword densities religiously. I had percentages memorized for the various engines, we were comparing the top results, kicking butt and taking names so to say with the rankings. Then I figured that you could accomplish the same thing with just writing regular paragraphs and bam! I never calculated a keyword density or ran one of the many tools we use to do so.

Quote:
and maybe get DevShed to build a term weight measurement tool


What for? Would be as useful as a screendoor on a submarine. (Jasonnytc said that one day, and loved it) :-)

Reply With Quote
  #9  
Old April 6th, 2005, 11:22 AM
pk_synths's Avatar
pk_synths pk_synths is offline
Contributing User
SEO Chat Specialist (4000 - 4499 posts)
 
Join Date: May 2003
Location: Chicago
Posts: 4,109 pk_synths User rank is Corporal (100 - 500 Reputation Level)pk_synths User rank is Corporal (100 - 500 Reputation Level)pk_synths User rank is Corporal (100 - 500 Reputation Level)pk_synths User rank is Corporal (100 - 500 Reputation Level) 
Time spent in forums: 5 Days 17 h 45 m 3 sec
Reputation Power: 12
Send a message via AIM to pk_synths Send a message via Yahoo to pk_synths
Great post guys. I've also noticed that KW density isnt playing a very big role in ranking anymore but I think it is still used as a last resort if Google stumbles upon a site with many pages about the same topic/weight and doesnt know which one to display in the SERPs. Google usually picks the page with an average (natural) density over low or high density pages of the same topic but the site ranks the page in the SERP and not the density itself. I've seen it happen to my pages quite a few times.
__________________
-PK

Litigation Support

Reply With Quote
  #10  
Old August 31st, 2005, 07:19 PM
AZdevil AZdevil is offline
Contributing User
SEO Chat Newbie (0 - 499 posts)
 
Join Date: Aug 2005
Location: Scottsdale, AZ
Posts: 37 AZdevil User rank is Corporal (100 - 500 Reputation Level)AZdevil User rank is Corporal (100 - 500 Reputation Level)AZdevil User rank is Corporal (100 - 500 Reputation Level)AZdevil User rank is Corporal (100 - 500 Reputation Level) 
Time spent in forums: 9 h 32 m 47 sec
Reputation Power: 6
I was under the impression, kwd is somewhat valuable and can serve as a tool to avoid keyword stuffing and getting sandboxed. Even though it isn't exactly the first order of offense in SEO, I still like to keep my eye on it to avoid keyword stuffing. I read somewhere that kwd for your keywords should stay around 9-10%, anything over that and you may get sandboxed. Does that sound right?

Reply With Quote
  #11  
Old August 31st, 2005, 07:51 PM
randfish's Avatar
randfish randfish is offline
SEO Chat Intermediate (1500 - 1999 posts)
 
Join Date: Jul 2004
Location: Seattle, WA
Posts: 1,874 randfish User rank is Sergeant (500 - 2000 Reputation Level)randfish User rank is Sergeant (500 - 2000 Reputation Level)randfish User rank is Sergeant (500 - 2000 Reputation Level)randfish User rank is Sergeant (500 - 2000 Reputation Level)randfish User rank is Sergeant (500 - 2000 Reputation Level) 
Time spent in forums: 6 Days 12 h 54 m 32 sec
Reputation Power: 11
KWD can be measured as part of spam, but it's really relative to document size, so a specific % can't be given. If you normalize document size, however, I would surmise that KWD over 5% is already risky...

Reply With Quote
  #12  
Old September 16th, 2005, 12:32 AM
mickh's Avatar
mickh mickh is offline
Contributing User
SEO Chat Newbie (0 - 499 posts)
 
Join Date: Sep 2005
Posts: 143 mickh User rank is Private First Class (20 - 50 Reputation Level)mickh User rank is Private First Class (20 - 50 Reputation Level) 
Time spent in forums: 1 Day 14 h 36 m 23 sec
Reputation Power: 4
So the practical application of this insight is:

1. Don't worry about keywords at all?

2. Only add keywords until they're more frequent than any other words, then stop (possibly because keeping going will trigger some KW stuffing penalty)?

Reply With Quote
  #13  
Old September 25th, 2005, 03:29 AM
PRNewsNow's Avatar
PRNewsNow PRNewsNow is offline
Contributing User
SEO Chat Newbie (0 - 499 posts)
 
Join Date: Jul 2005
Location: New Mexico
Posts: 152 PRNewsNow User rank is Private First Class (20 - 50 Reputation Level)PRNewsNow User rank is Private First Class (20 - 50 Reputation Level) 
Time spent in forums: 23 h 58 m 31 sec
Reputation Power: 4
thanks I think that last one dumbed it down to the right level for me.

Reply With Quote
  #14  
Old September 26th, 2005, 09:24 PM
mickh's Avatar
mickh mickh is offline
Contributing User
SEO Chat Newbie (0 - 499 posts)
 
Join Date: Sep 2005
Posts: 143 mickh User rank is Private First Class (20 - 50 Reputation Level)mickh User rank is Private First Class (20 - 50 Reputation Level) 
Time spent in forums: 1 Day 14 h 36 m 23 sec
Reputation Power: 4
Er, well, that last one was actually a question. I was trying to dumb it down for myself and make sure I got it right. May not be though!

Reply With Quote
  #15  
Old September 26th, 2005, 10:59 PM
PRNewsNow's Avatar
PRNewsNow PRNewsNow is offline
Contributing User
SEO Chat Newbie (0 - 499 posts)
 
Join Date: Jul 2005
Location: New Mexico
Posts: 152 PRNewsNow User rank is Private First Class (20 - 50 Reputation Level)PRNewsNow User rank is Private First Class (20 - 50 Reputation Level) 
Time spent in forums: 23 h 58 m 31 sec
Reputation Power: 4
Ok.... Guess we will both wait for an answer. Sounded right to me in that perspective though.

Reply With Quote