Supermind Search Consulting Blog 
Solr - Elasticsearch - Big Data

A Tutorial on Dynamic Programming

Posted by Kelvin on 08 Jan 2007 | Tagged as: blogmark

http://mat.gsia.cmu.edu/classes/dynamic/dynamic.html (Via )

Sequence Comparison: Some Theory and Some Practice

Posted by Kelvin on 08 Jan 2007 | Tagged as: blogmark

http://www.ime.usp.br/~is/papir/sctp/ (Via Imre Simon :: home page)

String Similarity Metrics for Information Integration

Posted by Kelvin on 08 Jan 2007 | Tagged as: blogmark

http://www.dcs.shef.ac.uk/~sam/stringmetrics.html

JFK Airport – How to Get To JFK Airport from New York City

Posted by Kelvin on 03 Jan 2007 | Tagged as: blogmark

http://www.nysubway.com/airport/jfk.html

Fuzzy string matching

Posted by Kelvin on 03 Jan 2007 | Tagged as: programming

I've been recently peripherally involved in a project which attempts to perform a fuzzy match on names in a MySQL database. With "Homethinking":http://www.homethinking.com, we had to do something similar regarding matching for realtor and brokerage names. Its also related to some of the Lucene consulting I've been involved with. Its an interesting problem. There's an […]

Bitap algorithm – Wikipedia, the free encyclopedia

Posted by Kelvin on 03 Jan 2007 | Tagged as: blogmark

http://en.wikipedia.org/wiki/Bitap_algorithm Interesting fuzzy string match algorithm using bitwise operations

Exploring Hadoop SequenceFile

Posted by Kelvin on 03 Jan 2007 | Tagged as: Lucene / Solr / Elasticsearch / Nutch

Hadoop's SequenceFile is at the heart of the Hadoop io package. Both MapFile (disk-backed Map) and ArrayFile (disk-backed Array) are built on top of SequenceFile. So what exactly is SequenceFile? Its class javadoc tells us: Support for flat files of binary key/value pairs.– not very helpful. Let's dig through the code and find out more: […]

MySQL Falcon open-sourced

Posted by Kelvin on 02 Jan 2007 | Tagged as: programming

Just read that "MySQL Falcon storage engine":http://www.mysql.org/doc/refman/5.1/en/se-falcon.html has been "open-sourced":http://it.slashdot.org/article.pl?sid=07/01/02/209227. http://mike.kruckenberg.com/archives/2006/04/jim_starkey_int.html has a really good, concise brief on Falcon and what it does.

chris glass: How to catch a mouse without a mousetrap

Posted by Kelvin on 02 Jan 2007 | Tagged as: blogmark

http://glass.typepad.com/journal/2005/09/how_to_catch_a_.html What a goooooood idea..

Ehud on forces driving commercial programming language creation

Posted by Kelvin on 01 Jan 2007 | Tagged as: programming

Just discovered this gem on LtU here: Ehud Lamm – Re: Growing a Language 5/25/2004; 3:02:35 PM (reads: 118, responses: 0) (in response to a previous post: I'll now turn the tables back: is it possible that the "commercial" languages (VB, Java, C#) gained popularity because they were created to provide programmers what they wanted […]

« Previous PageNext Page »