Why You Really Need to Know: How Data Mining Works and How It Is Affecting You

September 10, 2008 · 5 comments

You need to clearly understand how data mining works because it is already affecting your life and will do so much more in the very near future. It is simple to understand and if you’re reading this there is a 90+% chance you use it yourself every day. Any person who wants to know all about you can do a similar search across multitudes of supposedly private databases.

A search engine (Google, Yahoo, Ask – even my favorite Zuula) are all data mining services. You type in what you want to find and the search engine provides the information you’ve requested. It isn’t quite that simple as the search engines filter the data they provide. Filtering is used to eliminate some results and decide in what order the information is presented.

Since most people only view the first page or two, the search engine has great power to influence what information is widely viewed and what is likely to be overlooked. Some are taking this further and actually censoring what appears to be available online. For now we’ll focus on data mining though.

So why should you care about data mining? Try this example. Type in the name of any famous person in your favorite search engine or tool bar. Notice all the information that you would consider private that is available to anyone online. That is just what is supposedly publicly available. What do you suppose is in private databases on that famous person – and on you? What if someone could search all of these at once:

  • All three credit reports
  • Check reporting companies
  • Medical records
  • Employment records

Being able to search by name once helped reduce what searches could find on you. This is because many people sometimes use their name in many different forms:

  • First Last
  • First Middle Last
  • First Middle Initial Last

Add to that various ways women have used for maiden, married, and then hyphenated forms and you would think that would make data more difficult to accumulate.

The power of data mining combined with identifying numbers – especially social security numbers – has largely eliminated that issue. Prove it yourself quickly by doing a quick search on any number of public databases. Here is one example:

  • Search for Jane Smith showing cities lived in, potential relatives, their age – and that is only what they’ll give you for free. Try searching for your own name or those of others you know. Imagine what someone evil might do with that information that is now free to anyone who knows how to find it?

That is not the only database online. Check out WhitePages People Search which may offer your job title, employer, members of your household, and your age to anyone who knows your name. They also offer to sell more of your personal information to those willing to pay the minimal fees they charge.

Did you notice the WhitePages Neighbor Search function? Type in any address and the database will give you the names and addresses of your neighbors. While this could be a good way to find contact information in an emergency, it could be used for evil. Those who are not aware how much information is now available to strangers could easily be targeted by con artists. If someone knows your neighbor’s names and their kids names what else might you unknowingly reveal to them because you assume this person is a friend or relative?

The fastest way to pull information about you across all sources is by using any information that is common between them. The most dangerous of these is something that never changes and the one and only piece of personal information that almost everyone has for life is their Social Security Number (SSN).

When I was young I clearly remember promises that our social security numbers (SSN) were ONLY for tax purposes and would never be used for identification! It even said that on the bottom of the little paper SSN card: FOR SOCIAL SECURITY AND TAX PURPOSES — NOT FOR IDENTIFICATION. Even the SSA’s own Web site confirms that we were assured Social Security Numbers would not be used as a National ID.

Any databases that contain social security numbers can pull all of your personal information together in minutes or faster! This is why it is a very good idea to stop giving your SSN number out to everyone who asks. Do NOT print it on your checks. Many businesses including electric companies and other utilities will insist it is mandatory when it is not – yet. Keep refusing or offer to use only the last four digits and escalate to management.

Why should you care? Where are we going? Who can speak out if even Christianity is being censored? Are you sure you can handle the truth?

LEARN TO PROTECT (as much as possible) YOUR PERSONAL PRIVACY AND FUTURE:

[WARNING: One of the easiest ways to track someone is to offer them a way to opt out of tracking! The number of groups that appear to protect us actually hurt us. Telling the difference is extremely difficult. Don't be in any hurry to use these types of sites.]

ADDITIONAL RESOURCES on PRIVACY CONCERNS:

ADDITIONAL RESOURCES on DATA MINING:

PERSONAL INFORMATION IN DATABASES COMPROMISED:

USES of SOCIAL SECURITY NUMBERS:

ADDITIONAL REFERENCE RESOURCES ON SOCIAL SECURITY NUMBERS

Stay In The Loop!

Subscribe to the GrowMap feed via RSS or Email to receive notifications when new posts are published. Follow GrowMap on Twitter too!

{ 5 comments… read them below or add one }

jack parler March 11, 2009 at 2:34 am

Very nice information. Thanks for this.

Reply

InternetStrategist from GrowMap.com March 20, 2009 at 1:17 am

Twitter: @GrowMap

In another blog someone asked whether Google could track users across Web sites. Here is some detail on that:

DoubleClick gets in trouble for wanting to merge user identification with behavior. There was publicity and legal action:

http://www.cdt.org/privacy/000302doubleclick.shtml

http://www.out-law.com/page-2486

Later, Google acquired DoubleClick amid protests by privacy groups. They finally overcame the last obstacle and got EU clearance for the merger:

http://www.marketwatch.com/news/story/google-lands-eu-clearance-doubleclick/story.aspx?guid={041C7A8C-FF82-4654-A874-62EF04D53463}

The MarketWatch article says:

“DoubleClick is expected to provide Google with a trove of existing customers, online inventory and technology needed to better track Web users across sites in order to deliver them targeted display advertisements.”

and

“Microsoft complained publicly last year that Google’s purchase of DoubleClick would result in “sole control over the largest database of user information the world has ever known.”

Do you suppose that they won’t use the capability provided by their acquisition? Want more details? See these:

http://www.pcworld.com/article/131007/ftc_asked_to_block_googledoubleclick_merger.html

http://www.euractiv.com/en/competition/consumer-groups-alarmed-google-online-ad-merger/article-165346

http://epic.org/privacy/ftc/google/

InternetStrategist’s last blog post..Making Money with Your Blog – Part 2 – Evaluating Affiliate Programs

Reply

InternetStrategist from GrowMap.com March 20, 2009 at 1:19 am

Twitter: @GrowMap

@jack parler I hope you saw the other posts in the privacy section and especially the comments in the post When FREE Is Definitely Not: What Those Free Services Are Really Costing You which I will link from this comment.

You can access all posts related to privacy and data mining under categories in the right sidebar.

InternetStrategist’s last blog post..REVIEW: Mastering Google Analytics – Easy to Understand FREE eCourse

Reply

Shaun from personal checks June 6, 2009 at 3:21 am

That’s quite a nice informative post. But I dont understand much about data mining. Can you give me more info about it?

Shaun’s last blog post..Spotted a Trend

Reply

Internet Strategist from GrowMap June 26, 2009 at 12:16 pm

Twitter: @GrowMap

Hello Shaun,

Data mining is a simple concept although it can extremely complex in implementation. It is any method used to extract specific information from a database.

The largest danger for individuals is the enormous amount of personal information being compiled by searching across multiple databases. Few realize that their information – even medical records – are widely available.

When services collate information from many sources and then make it available online to the general public, for a fee, or to those with the money there can be serious consequences. For example, I find it unsettling that some white pages sites now list the names and ages of children.
Internet Strategist @GrowMap´s last blog ..Select Keywords First to Make Your Content Easy to Find My ComLuv Profile

Reply

Leave a Comment

CommentLuv Enabled

This site uses KeywordLuv. Enter YourName@YourKeywords in the Name field to take advantage.

*
To prove you're a person (not a spam script), type the answer to the math equation shown in the picture. Click on the picture to hear an audio file of the equation.
Click to hear an audio file of the anti-spam equation

Previous post: Slip of the Tongue or Truth Revealed: You Have Zero Privacy Anyway – Get Over It

Next post: When FREE Is Definitely Not: What Those Free Services Are Really Costing You