Valentine's tech log: [PostgreSQL] Index sizes depending on the type of the field being indexed
http://tech.valgog.com/2011/04/index-sizes-depending-on-type-of-field.html empirical data about difference in index sizes depending on the data type
http://tech.valgog.com/2011/04/index-sizes-depending-on-type-of-field.html empirical data about difference in index sizes depending on the data type
http://www.schneier.com/blog/archives/2010/08/a_taxonomy_of_s_1.html social network user data categorization by Bruce Schneier (useful)
http://www.google.com/adplanner/static/top1000/ Top 1000 by google
http://sql.ru/forum/actualthread.aspx?tid=754989 Some comments on modeling friendlits user activity, for social media websites (or apps)
http://googleresearch.blogspot.com/2006/08/all-our-n-gram-are-belong-to-you.html google word frequency dataset, n-gram models. Donwload or DVDs are available from U of Penn. ~24GB.
http://www.syleum.com/2009/03/17/healthcare-data-model/ tells about problems with IT in HealthCare and Obama’s electronic records plan from one person’s 1fst hand experience. Make user u read the comments. (allow 30-40 minutes )
https://projects.commandprompt.com/public/pitrtools/ PITRTools is a set of wrapper scripts that provide warm standby functionality to PostgreSQL. The software is essentially two scripts, cmd_archiver.py and cmd_standby.py. The project is under the BSD license.
http://www.halfgaar.net/why-power-failures-are-bad-for-your-data artilcle tells what happens when u pull the plug … in short nothing good will happen, so use UPS! read article for details
http://fuji.web.cern.ch/fuji/talk/2007/kelemen-2007-C5-Silent_Corruptions.pdf Interesting paper about data corruption from CERN. Basically they did verify that when you write a file to a disk, it not always come back the way it was written. …. so …. you have been warned ;-)
http://www.databaseanswers.org/data_models/index.htm useful data model samples