Valentine's tech log: [PostgreSQL] Index sizes depending on the type of the field being indexed

http://tech.valgog.com/2011/04/index-sizes-depending-on-type-of-field.html empirical data about difference in index sizes depending on the data type

Schneier on Security: A Revised Taxonomy of Social Networking Data

http://www.schneier.com/blog/archives/2010/08/a_taxonomy_of_s_1.html social network user data categorization by Bruce Schneier (useful)

Top 1000 sites[on the internet] - [from google's] DoubleClick Ad Planner [statitics]

http://www.google.com/adplanner/static/top1000/ Top 1000 by google

[RUS]Проектирование френдленты и рейтингов / Проектирование БД : Форум на SQL.RU | [Forum SQL.ru] Modeling frendlist, ratings

http://sql.ru/forum/actualthread.aspx?tid=754989 Some comments on modeling friendlits user activity, for social media websites (or apps)

Official Google Research Blog: All Our N-gram are Belong to You

http://googleresearch.blogspot.com/2006/08/all-our-n-gram-are-belong-to-you.html google word frequency dataset, n-gram models. Donwload or DVDs are available from U of Penn. ~24GB.

Analysis from the Bottom Up | The Data Model That Nearly Killed Me

http://www.syleum.com/2009/03/17/healthcare-data-model/ tells about problems with IT in HealthCare and Obama’s electronic records plan from one person’s 1fst hand experience. Make user u read the comments. (allow 30-40 minutes )

Pitrtools - Trac /set of tools for setting up PITR on PostgreSQL/

https://projects.commandprompt.com/public/pitrtools/ PITRTools is a set of wrapper scripts that provide warm standby functionality to PostgreSQL. The software is essentially two scripts, cmd_archiver.py and cmd_standby.py. The project is under the BSD license.

Why power failures are bad for your data

http://www.halfgaar.net/why-power-failures-are-bad-for-your-data artilcle tells what happens when u pull the plug … in short nothing good will happen, so use UPS! read article for details

kelemen-2007-C5-Silent_Corruptions.pdf (application/pdf Object)

http://fuji.web.cern.ch/fuji/talk/2007/kelemen-2007-C5-Silent_Corruptions.pdf Interesting paper about data corruption from CERN. Basically they did verify that when you write a file to a disk, it not always come back the way it was written. …. so …. you have been warned ;-)

Library of Free Data Models from DatabaseAnswers.org

http://www.databaseanswers.org/data_models/index.htm useful data model samples