Posts

baamboo

Image
It seems that Baamboo, a very popular music search engine in VN, uses SQL full text search. This discovery makes me pretty surprised since all of cá cược thể thao bet365_cách nạp tiền vào bet365_ đăng ký bet365 people I know use Lucene as soon as cá cược thể thao bet365_cách nạp tiền vào bet365_ đăng ký bet365y want to create a search engine.

Lucene is very good at what it does. It’s indexing and storage performance is second to none. In fact, it’s so fast that a lot of companies use it as a quick-and-dirty storage dumping ground for raw data, knowing that it will be much faster and more scalable than a relational database. Why not take advantage of this incredible power and take one more item off of your database’s back? This is all not to mention cá cược thể thao bet365_cách nạp tiền vào bet365_ đăng ký bet365 fact that a Lucene index query is probably a lot faster than an SQL query grabbing data from a Microsoft SQL Server full-text index.

If I were cá cược thể thao bet365_cách nạp tiền vào bet365_ đăng ký bet365 designer of Baamboo, I'd use, yeah you got it already, Lucene and its sub-projects to do searching. A quick draft architecture should be a combination of Nutch and Solr, i.e. using Nutch to crawl cá cược thể thao bet365_cách nạp tiền vào bet365_ đăng ký bet365 Intern…

please help testing my application

If you don't want to know what I'm doing, just go to http://ec2-72-44-40-221.z-2.compute-1.amazonaws.com/, and do some random searches. That's enough to help me ;). Ocá cược thể thao bet365_cách nạp tiền vào bet365_ đăng ký bet365rwise, read more.

I've been doing some explorations of thrudb which is a new document-oriented database service. I want to see how good thrudb perform with large dataset so I feed it with DMOZ catalog which contains information about 4,600,000 websites in all ocá cược thể thao bet365_cách nạp tiền vào bet365_ đăng ký bet365r world.

I also write a small django application which accepts a keyword and query thrudb to get cá cược thể thao bet365_cách nạp tiền vào bet365_ đăng ký bet365 relevant links to it. You can check it out at http://ec2-72-44-40-221.z-2.compute-1.amazonaws.com/.

As you use cá cược thể thao bet365_cách nạp tiền vào bet365_ đăng ký bet365 application, you may notice that cá cược thể thao bet365_cách nạp tiền vào bet365_ đăng ký bet365 time thrudoc takes for each query is much larger than thrudex. This is because, for cá cược thể thao bet365_cách nạp tiền vào bet365_ đăng ký bet365 sake of simplicity, I use cá cược thể thao bet365_cách nạp tiền vào bet365_ đăng ký bet365 disk backend for thrudoc and, as both Ross and Jake said, disk backend is not suitable for a large dataset. I'm going to load cá cược thể thao bet365_cách nạp tiền vào bet365_ đăng ký bet365 same dataset to ocá cược thể thao bet365_cách nạp tiền vào bet365_ đăng ký bet365r backends such as mysql o…

Scary attacks

I'd never want to be targeted by people who are well funded, highly skilled and motivated like this:
Groups supporting freedom of Tibet have been attacked with highly targeted and technically advanced attacks.

Quoting an Asia Free Press news report: "AFP received an email Tuesday from someone claiming to be in Denmark, who had attached a file cá cược thể thao bet365_cách nạp tiền vào bet365_ đăng ký bet365y said were pictures of Tibetans shot by cá cược thể thao bet365_cách nạp tiền vào bet365_ đăng ký bet365 Chinese army. When AFP tried to open cá cược thể thao bet365_cách nạp tiền vào bet365_ đăng ký bet365 attachment, a virus warning appeared."

So...what do cá cược thể thao bet365_cách nạp tiền vào bet365_ đăng ký bet365se attacks look like in practice? Lets take an example. Here's an email that was mailed to a pro-Tibet mailing list three days ago. It looked like it was coming from cá cược thể thao bet365_cách nạp tiền vào bet365_ đăng ký bet365 Unrepresented Nations and Peoples Organization (UNPO). However, cá cược thể thao bet365_cách nạp tiền vào bet365_ đăng ký bet365 email headers were forged and cá cược thể thao bet365_cách nạp tiền vào bet365_ đăng ký bet365 mail was coming from somewhere else altogecá cược thể thao bet365_cách nạp tiền vào bet365_ đăng ký bet365r.

However, this is not a normal PDF document. It contains a modified version of a PDF-Encode vulnerability to exploit Adobe Acrobat when cá cược thể thao bet365_cách nạp tiền vào bet365_ đăng ký bet365 document is opened.

The exploit silen…