Sunday, 20 November 2011

Controlling the Web


Google lawyers explain how the financial blockade of Wikileaks could apply to anyone under SOPA.

Documents obtained by Wall St Journal open window into a new global market for off-the-shelf surveillance technology

UK Internet Censors
Blacklist Fileserve File-Hosting Service

HP Computers Underpin Syrian Surveillance

When you request your personal data, Facebook generously keeps most of it so it doesn't overwhelm you.

The Internet gold rush: why your data is valuable.

A secure Internet can save people's lives.

The Internet can recognise your face.

How does Twitter choose trending topics?

Who controls the Internet?

Who controls Asian cyberspace?

Introduction to Parallel Programming and MapReduce

This may, for some of you, be helpful background for the lecture on Indexing the Web.

Google Code: Introduction to Parallel Programming and MapReduce

Wednesday, 16 November 2011

Eliza lab

Today you'll experiment with using regular expressions to mimic Eliza's responses. Using this tool, type in search patterns to match the kinds of sentences that people often type as input to Eliza. Then in the replacement pattern box, write a pattern to produce an Eliza-like response. For example, the search pattern
  • I am (\b\w+\b)
with the replacement pattern
  • How long have you been $1?
will match the sentence
  • I am bored
typed in the input text box and produce
  • How long have you been bored?
in the output text box.
So, now you try it. Write a regular expression search pattern to match sentences of the form:
  • I am sick
  • I am tired
  • I am hungry
Does your search pattern match the following strings and produce the kind of response you would expect?
  • Today I am bored
  • I feel very happy
  • I feel awfully tired
If not, fix your search pattern so that the system generates sensible outputs for each of these. Now try these
  • I feel cold and I want a nap
  • I feel cold and I want a hot drink
  • I like food and drink
  • I like to sing and dance
What if you wanted to generate the response:
  • Have you always enjoyed eating and drinking
in response to the input:
  • I like to eat and drink
Would your search pattern also work for
  • I like to sing and dance
Now write a search pattern that will match sentences like:
  • I am tired and hungry
and produce the response:
  • What do you think makes you hungry and tired?
Now play with this version of Eliza. Find cases where it doesn't respond very naturally and try and write search and replacement patterns that solve the problems you encounter. (Note: they may not all be solvable just by this simple search and replace method.)

Monday, 14 November 2011

More links on SOPA and Protect-IP

SOPA stands for the Stop Online Piracy Act. It is one of two bills being considered by the US congress. The other is the Protect-IP Act.

Note that in a topical and contentious area such as this, Wipipedia may not reflect a settled consensus. It is useful, and sometimes illuminating, to look at the page history to see the "edit wars" that erupt.