I’m a little intoxicated, I’m not gonna lie. So what if it’s not even 10PM and it’s a Tuesday night? The Kirkland facebook is open on my desktop and some of these people have pretty horrendous facebook pics.


Billy Olson’s sitting here and had the idea of putting some of these next to pictures of farm animals and have people vote on who’s hotter.


Yea, it’s on. I’m not gonna do the farm animals but I like the idea of comparing two people together. It gives the whole thing a very “Turing” feel since people’s ratings of the pictures—


--will be more implicit than, say, choosing a number to represent each person’s hotness like they do on hotornot.com. The first thing we’re going to need is a lot of pictures.


Unfortunately, Harvard doesn’t keep a public centralized facebook so I’m going to have to get all the images from the individual houses that people are in. Let the hacking begin.



First up is Kirkland. They keep everything open and allow indexes in their Apache configuration, so a little WGET magic is all that’s necessary to download the entire Kirkland facebook. Kids’ stuff.

最初はカークランド寮(Kirkland)。"Apache"(アパチ)というのは、サーバーの設定をするソフトだと思ってください。"allow indexes in their Apache configuration" とありますが、このApacheの設定次第では、サイトのデータがどこにあるか、という一覧(index)が見れるようになってしまうんですね。



マークが何をやっているかというとすごい単純で、カークランド寮は、学生の情報に誰でもアクセスできてしまうので、そこのサイトにWGETというコマンドを実行して、女性の顔写真をダウンロードしたぜ、ってことですね。簡単ですよね。マークも Kids' stuffと言っています。

Next is Elliot. They’re also open but with no indexes on Apache. I can run an empty search and it returns all of the images in the database in a single page. Then I can save the page and Mozilla will save all the images for me. Excellent. Moving right along.



Lowell has some security. They require a username/password combo and I’m going to go ahead and say they don’t have access to main FAS user database, so they have no way of detecting an intrusion.


Adams has no security but limits the number of results to twenty a page. All I need to do is break out the same script I used on Lowell and we’re set.


Quincy has no online facebook, what a sham. Nothing I can do about that.


Dunster is intense. Not only is there no public directory, but there’s no directory at all. You have to do searches, and if your search returns more than 20 matches, nothing gets returned. And once you do get results, they don’t link directly to the images; they link to a php that redirects or something. Weird. This may be difficult. I’ll come back later.





Leverett is a little better. They still make you search, but you can do an empty search and get links to pages with every student’s picture. It’s slightly obnoxious that they only let you view one picture at a time, and there’s no way I’m going to go to 500 pages to download pics one at a time, so it’s definitely necessary to break out emacs and modify that perl script. This time it’s going to look at the directory and figure out what pages it needs to go to by finding links with regexes. Then it’ll just go to all of the pages it found links to and jack the images from them. It’s taking a few tries to compile the script…another Beck’s is in order.



"break out emacs and modify that perl script"


"by finding links with regexes. "とありますが、regexesというのは、「正規表現」というもののことで、例えば正規表現では、ドット(. のことです)は、「任意の一文字」を意味します。つまり、「a」も「b」も「c」も、ドット(.)にマッチするんですね。


最後の、"It’s taking a few tries to compile the script"とは、perlというプログラミング言語でプログラムを書いたわけですが、そこにミスがあったので、実行してもうまくいかなかったんですね。なので、何回か修正したのち実行をしたら、うまくいったよ、っていう話です。

emacs: エディターの1つ

perl: プログラミング言語の1つ

regexes: 正規表現

compile: 書いたプログラムを「翻訳」すること。コンピューターは2進数しか理解できないため、人が書いたプログラムを、コンピューターが理解できるように、翻訳することをコンパイルといいます。

Mather is basically the same as Leverett, except they break their directory down into classes. There aren’t any freshmen in their facebook…how weak.