Sunday, February 28, 2010
Ramdrive and Memory FS
On Unix this is typically realized mounting a memory file system:
mount_mfs -s 20m swap /workOn Windows (XP/Vista/7), I found very useful to use this software Ramdisk from dataram, which creates a normal disk but in ram.
Saturday, February 27, 2010
Friday, February 26, 2010
Random Permutations
Thursday, February 25, 2010
Get k tags out of n tags
Wednesday, February 24, 2010
Largest span of increasing pair in an array of integers
Tuesday, February 23, 2010
Inversions
Sunday, February 21, 2010
Malloc and Free
Saturday, February 20, 2010
Find what is missing
Friday, February 19, 2010
Find all the patterns in a string
Thursday, February 18, 2010
Wednesday, February 17, 2010
Google Now Includes MySpace Status Updates in Real-Time Search Results
Tuesday, February 16, 2010
Google Buzz Kills Auto-Follow on Privacy Concerns
Perhaps Google got a little too comfortable with Internet ignorance. They certainly struck a nerve with the autofollowing and autosharing of Buzz. Still, it's doubtful this will do any serious damage to Google. The psychological relief of getting what you want from a company (in this case, greater perceived privacy) is easier than changing your email and search habits."
source: searchenginewatch
Monday, February 15, 2010
One fly and two colliding trains.
Sunday, February 14, 2010
Carl Icahn selling off Yahoo shares
"Carl Icahn has substantially cut his stake in Yahoo, according to regulatory filings made public Friday.
The billionaire investor had just under 12 million shares of Yahoo at the end of 2009, according to the new filing with the Securities and Exchange Commission. That compares with more 60 million shares he held last summer"
Saturday, February 13, 2010
Dark times at yahoo? Looking for next steve jobs
Friday, February 12, 2010
Google to buy Aardvark
Is google returning to Q&A?
Thursday, February 11, 2010
IAC writes down value of search unit by nearly 1ドル billion
"New York-based IAC (NASDAQ: IACI) wrote down 991ドル.9 million from the goodwill on IAC Search & Media, the part of its business that contains Ask.com and also the much smaller Dictionary.com. Goodwill is a company’s guess about the future earning power of an asset or company it has bought. It’s the difference between the price paid for the asset and its book value on the balance sheet. Ask.com started life in Berkeley in 1996 as Ask Jeeves -- it was an early dot-com darling. Later, the business moved to a tower in downtown Oakland and was bought by IAC in 2005 in a deal that valued it at 1ドル.85 billion."
Wednesday, February 10, 2010
New comscore out
source: business insider
Tuesday, February 9, 2010
Towards Recency Ranking in Web Search
Towards Recency Ranking in Web Search is an high quality paper from Yahoo! about relevancy ranking. The main contribution of the paper is twofold: it presents a query classifier for recency and a ranking model for recent results.
The query classifier builds two models representing the Content and the Query data at time t, respectively. The two models are then compared on different instants of time and a query is considered recent if it increases his probability of being generated in two different istants. This approach is interesting. Nevertheless there are queries that would fresh results, even if they are constantly observed (such as "Obama", "Britney Spears", "stock quotation", etc).
The ranking model aims at learning a ranking function based on four categories of recency-related features: timestamp features, linktime features, webbuzz features and page classification
features. The learning algorithm is GBrank. To solve the recency data insufficiency problem, the authors explored several modeling approaches by utilizing regular ranking data. In compositional model the normal ranking output is used as a training feature, while in over-weighting model the normal ranking output is used with recency features and an emphirical optimal weight is derived. In adaptation model, training data from normal ranking is used for learning a regression tree model, which is then fine-tuned with recency ranking data.
The evaluation set is made up of 70,131 query-url pairs collected during a period of four months (Feb.∼May, 2009) judged by humans and is based on NDGC metrics. One final result is worth mentioning. In the paper, linktime features are the most important recency features among all recency features. Quoting the authors: "Thus, recency is competing with popularity, which is usually indicated by link-based features and click-based features. This leads to the interesting topic on how to appropriately deal with the relationship between recency and popularity"
Monday, February 8, 2010
Compute all the items which appears more than p% of time
Sunday, February 7, 2010
Compute all the items which appears more than 50% of time
Saturday, February 6, 2010
Slides for LinkedIn People Search
[フレーム]
Thanks to Greg for pointing out them. Interesting work@LinkedIn based on Lucene's customizations.
Friday, February 5, 2010
Beautiful video on Twitter Creation
Twitter Code Swarm from Ben Sandofsky on Vimeo.
Bing to power Facebook Search
Thursday, February 4, 2010
Anatomy of a Large-Scale Social Search Engine
- Users can ask questions in natural language, not keywords
- Content is generated “on-demand”, tapping the huge amount of information in peoples’ heads
- The system is fueled by the goodwill of its users
- 87.7% of questions sent to Aardvark got answered (very high answer rate!)
- 75.0% of users who asked Aardvark a question also answered a question for someone else (very high participation rate!)
- 70.4% of answer feedback had a rating of ‘good’ as opposed to ‘ok’ or ‘bad’ (high quality!)
Wednesday, February 3, 2010
Aol wil use Google, once again
Who is out of this list?
Tuesday, February 2, 2010
A collection of code competition sites
- http://geeksforgeeks.org/
- http://stackoverflow.com
- http://www.topcoder.com/tc
- http://www.spoj.pl
- http://www.projecteuler.net
- http://codechef.com
- http://uva.onlinejudge.org/
Anything missing from the list?