Saturday, February 25, 2012

Gaddafi explains the Internet to us



I know that this is old news, but I just came across Gadaffi's poetic description of the Internet last year. Worth a revisit if you're a despot or simply feeling a bit overwhelmed by lolcats:

This Internet, which any demented person, any drunk can get drunk and write in, do you believe it? The Internet is like a vacuum cleaner, it can suck anything. Any useless person; any liar; any drunkard; anyone under the influence; anyone high on drugs; can talk on the Internet, and you read what he writes and you believe it. This is talk which is for free. Shall we become the victims of "Facebook" and "Kleenex" and "YouTube"! Shall we become victims to tools they created so that they can laugh at our moods? We decide our destiny, based on facts and our needs. Besides, this is not the era of blood, of smoke, of burning, of knives and axes; this is the era of the people, and supposedly the era of democracy. Everything is by election and referendum, ie, through the people's direct authority, which is the people's direct democracy, and not through rumours, and Facebook, and YouTube, and the Kleenex and the cables of American Ambassadors. This world wide web Internet is laughing at us and damaging our countries; it is tearing up our clothes; and killing our children for it.

Friday, February 17, 2012

Where do Wikipedia edits come from?

Our team recently decided to look at the origins of edits to Wikipedia articles. The results are striking. But given what we already know about the uneven geographies of Wikipedia are perhaps not that shocking.


To make these maps we took quarterly data about the total number of edits (to all Wikipedia versions) to emerge from any territory (i.e. the amount of content that people are producing in each country) and averaged it over a two year period (2010-2011). The inequalities in the amount of content produced are stark: the US, Germany, the UK and France all have an average of over a million edits each quarter.

But then you get most of Africa and the Middle East where the average number of edits per quarter is only a few thousand. Interestingly, there are more edits than originate in Hong Kong each quarter than the entire continent of Africa.

Much of this variation can actually be explained by Internet population (i.e. the total number of Internet users in a country). However, even accounting for their generally low Internet populations, most countries in the MENA region and Sub-Saharan Africa still fall below their expected number of edits (we are currently working on some statistical models and writing a paper about this topic).


Zooming into the MENA region, the scale of some of these disparities become even more clear. In the map above, you can see that there are almost as many Wikipedia edits that come from Israel (215,333) as from the rest of the entire region combined (254,089)!


Finally, it might also make sense to look at the number of views per edit in the region. This gives us a sense of how much consumption vs. production of information is happening. Israel again stands out with fundamentally different characteristics from the rest of the region. In Israel their is much higher rate of information production on Wikipedia (as compared to number of views/consumption) than many of its neighbours. 

Libya and Iran also score well in this regard. In the case of Iran, we also see a lot of edits originating in the country (Iran has the second highest number of edits in the region). With Libya, it may also be the case that there is genuinely a high number of edits per views: but we may also be dealing with rounding errors given the very small number of both views and edits that we see in the country.

As always, there will be more maps and analysis up here soon. But I wanted to share these and ask for thoughts and comments as we start writing our paper.

Wednesday, February 15, 2012

My talk schedule at the New York AAG conference

The AAG Annual Meeting starts next week, and I wanted to quickly outline the talks and sessions I am presenting, organising, or co-authoring:

Feb 23rd: "Uneven Geographies of Knowledge: The Internet and the Need for Broader Participation" 5pm-8pm at the Development Geographies Specialty Group pre-conference.

Feb 24th: "The Technology of Religion: Mapping Religious Cyberscapes" 12:40pm - a paper that I have co-authored with Taylor Shelton and Matt Zook. Taylor will be presenting this one.

Feb 25th: "Augmented realities and uneven geographies: exploring the geolinguistic contours of the Web" 8:20am in the session on "Information Geographies" that I am also co-organising.

Feb 26th: "The First Annual Iron Sheep gathering/meeting/challenge"

Feb 28th: "Iron Sheep: An open session dedicated to lightning mapping and understanding VGI in the "wild"" 8:00am, and despite the fact that I'm a co-organiser, I'll unfortunately have to skip this one in order to get back to Oxford in time to teach my ICT and Development course (which starts an hour after this session ends!).


Monday, February 6, 2012

A critique of the Economist's "#AfricaTweets" story


The latest edition of the Economist contains an article titled “#AfricaTweets.” The piece contains a striking map that visualizes the “number of tweets” per country in the “top 20 African countries.”

The only problem is that the article doesn’t do what it promises. 

My problem with the Economist’s article isn’t their whimsical (and quite funny) commentary on the use of Twitter in Africa (e.g. they quote @MorganTsvangirai “******* **** ******* ****** ******** ****** ** ******* #ZimPolitics” and @Bono” Africans tweeting each other, not me, about news, not me #sadface”).

The issue is that the Economist makes no attempt whatsoever at qualifying the limitations of these data. 

For instance, the article begins with the statement that “Twenty countries sent over 11m tweets in the last quarter of 2011.” I believe this to be a vast underestimation of the amount of information pushed through the platform in Africa. 

Looking at the source document for the Economist’s data (something they neglect to link to), we see that the data naturally contain only geo-located Tweets (something they neglect to mention). This is important because only a very small proportion of tweets tend to contain any geodata. In June 2011, my team and I collected 19.6 billion tweets using the statuses/sample stream with spritzer access (this was a 19 day sample collecting both geocoded and non-geocoded tweets globally), and we found that only 0.7% of tweets contained geographic coordinates.

Original map created by Portland Communications above

This matters because it is conceivable that people in some countries are more likely to geolocate their tweets than others due to either social norms or access to the requisite devices (such as smartphones). In other words, by looking at geocoded tweets we're only seeing a tiny fraction of the content that passes through the platform.

This isn’t to say that there aren’t other ways to geolocate information on Twitter. In a recent paper, Scott Hale, Devin Gaffney and I recently analysed whether locations in user profiles (descriptions such as “Oxford, UK” or “Barad-dûr, Mordor, Middle Earth” that can be grabbed from the vast majority of tweets) can be used as a proxy for (much rarer) geocoded content. It turns out that profile information isn’t a great substitute for actual latitude and longitude coordinates.

Time zone settings are another approach to figuring out where information comes from, but our research shows that many users (especially within Africa) don’t seem to set their time zone.

Furthermore, there is no attempt to account for prolific users in these samples. Looking at the Economist’s map (and even the source document) doesn’t tell us if Gabon is in the top-20 list because a lot of people use the service in that country, or a small number of Twitter addicts all have their smartphone GPS buttons turned on. Knowing the answer to this question fundamentally changes how we should interpret the map. 

None of this means that the original maps produced by Portland Communications are fundamentally flawed. Geocoded tweets are both insightful and useful. It just shows that there are crucially important details to be aware of whenever analysing Twitter data (I won’t even get started on the different types of sampling methods in this post).  

Hundreds of millions of short messages are passed through Twitter every day, and this content has been used by researchers from fields as diverse as epidemiology, politics, marketing and geography to better understand, map and measure large-scale social, economic, and political trends and patterns. However, much of this analysis is carried out with only limited understandings of how best to work with the spatial and linguistic contexts in which that information was produced. 

Maps are powerful tools: they influence how we understand, enact, produce, and re-produce our world. This means that cartographers bear a significant amount of public responsibility. 

And any geographer will tell you that no map is true representation of anything. With the advent of easy-to-access Internet-based data, we therefore need to more than ever constantly ask critical questions about how online data are collected, analysed, and presented.

Wednesday, February 1, 2012


دعوة للمشاركة في ورشة عمل بخصوص ويكيبيديا



تسرنا دعوتكم لورشة عمل بخصوص ويكيبيديا لمدة يومين و تضم ثلة من الباحثين و ممثلي مؤسسة وكيميديا، نتبادل خلالها الأفكار و الخبرات حول ويكيبيديا بمشاركة خبراء و منتجين و مهتمين بشأن ويكيبيديا.
الغاية من هذه الورشة هي تبادل الآراء و فهم أهم العقبات و الحواجز التي تحول دون المشاركة في تطوير ويكيبيديا العربية.

للمشاركة يجب ان تتقن اللغة العربية أو الإنجليزية كما يجب أن تتوفر لديك أحد الشروط التالية:

أن تكون :

محررا لويكيبيديا العربية ،
محررا لويكيبيديا (أي لغة من اللغات) و تكتب مقالات حول الشرق الأوسط.
مترجما لمقالات بأي من اللغات التالية: العربية الفصحى، العربية باللهجة المصرية، الإنجليزية، الفرنسية، العبرية و الفارسية.
راغبا في المساهمة الفعالة بخصوص تطوير ويكيبيديا عربي.
عازما على تبادل أفكارك مع الحضور من خلال تقديم محاضرة بخصوص المواضيع التي من ضمنها

الصراع و التهميش على ويكيبيديا.

الحواجز التي تحول دون المشاركة في تطوير ويكيبيديا عربي.

استراتيجيات و أدوات التحايل على المشاركة في ويكيبيديا عربي.

منضمو الورشة:
الدكتور مارك قراهام، معهد الأنترنت بأكسفورد-المملكة المتحدة.
الدكتور برني هوقان، معهد الأنترنت بأكسفورد-المملكة المتحدة.
الدكتورة الهام العلاقي، الجامعة الأمريكية بالشارقة- الإمارات العربية المتحدة.

مكان الورشة: معهد الإعلام بالأردن - عمان.

تاريخ الورشة: 11 و 12 أبريل/ نيسان 2012.
يجدر العلم بتوفر منح المشاركة لدعم مصاريف السفر و الإقامة بمكان الورشة بعمان الأردن (حسب الحالات). كما يجدر التنويه بأن الأماكن محدودة، وعلى الراغبين في المشاركة أن يرسلوا طلب (صفحة واحدة) في أقرب وقت ممكن و قبل 10 مارس/ آذار 2012 يتضمن عرضا توضيحيا بخصوص مساهمتكم في اثراء ورشة العمل حسب ما تقدم ذكره.
للمزيد من المعلومات و التسجيل يرجى الاتصال بالدكتورة إلهام العلاقي عن طريق البريد الإلكتروني ilhemallagui@hotmail.com

Open invitation to a workshop in Amman: Middle Eastern Participation and Presence in Wikipedia



Your voice matters. Come and share your experience and opinions about Wikipedia with other Wikipedians, wiki producers, researchers, and representatives from the Wikimedia Foundation during a two-day workshop.

The goal of the workshop is to talk about and understand the most significant barriers to participation in Wikipedia in the Middle East and North Africa. As such, we would love to hear from you if you meet any of the following criteria:
  • A Wikipedian who edits Arabic Wikipedia
  • A Wikipedian who edits Wikipedia (in any languages) on articles about the Middle East
  • Someone who translates articles between any of the following language versions in Wikipedia: Arabic, Egyptian Arabic, English, French, Hebrew, Persian.
  • Someone who is eager to get more involved with the project, and would like to meet people with similar ambitions.
  • Someone that would like to give a short talk or presentation to other Wikipedians from the region (e.g. about conflict or marginalization, barriers to participation, and circumvention strategies and tools).
The workshop will have limited space available, so we ask everyone to submit a one page letter detailing why your participation will benefit Wikipedia, the goals of the workshop, and your personal development as a contributor to Wikipedia. 

Sessions and conversations will be held simultaneously in Arabic and English, and you will only need to be fluent in one of these languages to participate. 

In order to facilitate participation, we have a small number of scholarships available that will support travel to (and in some cases accommodation in) Amman.

Please email Dr. Ilhem Allagui at ilhemallagui@hotmail.com and express your interest in joining this workshop. Please discuss your experience and how involved are you with Arabic Wikipedia, you may be eligible to a travel grant to attend this workshop. 

Workshop location: Jordan Media Institute- Amman, Jordan
Workshop dates: April 11-12, 2012
More information about this project at: http://www.oii.ox.ac.uk/research/projects/?id=70

Workshop organisers:
Mark Graham (University of Oxford)
Bernie Hogan (University of Oxford)
Ilhem Allagui (American University of Sharjah)