Wednesday, May 23, 2012

Adieu French: comparing English and French Wikipedias

The English and French Wikipedias are the world's first and third largest versions of the encyclopedia (containing 3.9 and 1.3 million articles respectively).   

I thought that it might be instructive to compare the geographic coverage of the two. Even though there is three times as much content in English than French, one might assume that there are plenty of parts of the world in which people are more likely to annotate or augment space with French content. 

The results are contained in the map below:

We ultimately see only a few countries in which there is more French content: France (of course), Belgium, Luxembourg, the Francophone parts of the Maghreb (Algeria, Morocco, and Tunisia), the DRC, Senegal, and surprisingly Bosnia, Montenegro, and Kosovo.

You would expect the first eight countries on the list to have more French content than English, but there seems to be no obvious reason why Bosnia, Montenegro, and Kosovo have more French-language information about them. Then again, there is not necessarily a reason why there should be more English-language content in every other country in which neither French nor English is the primary language spoken.

Also interesting is that much of the rest of the Francophone world has more English-language content then French. Madagascar, Haiti, Cameroon, Mali etc. all have more written about them in English than French.

What does this map tell us? We know that the number of Wikipedia articles about a place isn't necessarily a great proxy for broader social or cultural relationships and patterns (e.g. the example of the heavy focus on Turkey in the Swahili Wikipedia). But perhaps these patterns of attention do still tell us something about the importance of English vs. French in some of these places. Rwanda, for instance, has more English-language content: a fact that reflects the country's shift into the Anglophone sphere.

Perhaps in much of the rest of the Francophone world we are also seeing a similar (although likely less-pronounced) shift towards use of English as a means of non-local communication and local representation to a broader audience.

I'd welcome any further thoughts or questions....

(for more information about this work, have a look at the other blog posts I've written about the geographies of Wikipedia)

Friday, May 18, 2012

New paper - "Thai Silk dot Com: Authenticity, Altruism, Modernity and Markets in the Thai Silk Industry"

I've just had a paper accepted to Globalizations: 

Graham, M. (2013) Thai Silk dot Com: Authenticity, Altruism, Modernity and Markets in the Thai Silk Industry. Globalizations 10(2).

The abstract is below and you can download a pre-publication version here.

The production of silk occupies a unique place in Thai cultural and economic practices. However, the practice is rarely passed on to the younger generation and is widely considered to be a dying craft. In response, influential organizations have proposed use of the internet as a way to reinvigorate the industry and attract new customers. This paper looks at the discourses used to sell silk and the ways that sellers are either framing Thai silk as a traditional craft in need of saving or as an enterprise that efficiently engage with the commercial needs of the global economy. The paper reviews the range of, often problematic, emotions, images and associations used to sell a dying craft. Ultimately, it argues that, in contrast to many of the theorized effects of the internet, it seems to be neither encouraging a mass homogenization or pushing sellers to effectively integrate themselves into global markets.

And a few random pictures from my fieldwork in Thailand: 







Saturday, May 12, 2012

Mapping Wikipedia edits from Europe

I'm still playing with our database of Wikipedia edits (which tells us how many contributions to the encyclopedia originate in each country) and made two more maps of Europe. The height of each country represents the number of edits originating in that place. The shading indicated the number of edits per Internet user (darker reds meaning higher per capita participation). 



We see that the majority of edits in Europe come from Germany, the UK, France and Italy. Other countries in Europe contribute far less to Wikipedia.

Interestingly though, Germany and UK have fairly low participation rates when normalised by Internet population. Internet users in Italy, Scandinavia, the Baltic States, and even Ukraine are more likely to make an edit to Wikipedia than their British or German counterparts.

Also notable are the relatively low (total and relative) participation rates from Portugal and Poland.

If you're interested in Wikipedia edit data, take a look at the other regional maps that I've made:
Middle East and North Africa
South America
Africa
Global

Tuesday, May 8, 2012

Hiring a part-time research assistant to do statistical, spatial, and social analysis

Bernie Hogan and I are hiring a part-time Research Assistant to carry out research into the geography and social structure of Wikipedia in the Middle East and North Africa through large-scale data analysis. The position will involve the analysis of the corpus of Wikipedia text, user-pages and history files and the use of statistical techniques to explain spatial and social patterns. Our research question focuses on patterns of representation on Wikipedia as well as an articulation of patterns of conflict and barriers to participation.

The successful candidate will manage and perform queries on a large database, statistically and geographically analyse and visualise results, explore alternate methods to answer the project's core research questions, and assist in writing academic papers and technical reports.

Essential attributes:
• A graduate degree or postgraduate training in quantitative social science. Preference will be given to candidates in geography or sociology;
• Experience with statistical modeling, particularly regression analysis;
• Experience working with databases and large datasets (i.e. N > 1 million);
• Familiarity with GIS software;
• Familiarity with social network analysis software;
• Ability to work autonomously and be creative in the ways that you answer research questions.

Desirable attributes: 
• Experience visualising statistical, social networks and geographic data;
• Experience with text mining;
• Experience writing for an academic audience (i.e. journal articles and book chapters);
• Interest in and enthusiasm for the work of the OII;
• Experience working with publicly available secondary datasets.

The deadline for applications is June 1. Please get in touch if you have any questions.

(link to apply is here)

Links to our relevant Wikipedia projects:
http://www.oii.ox.ac.uk/research/projects/?id=66
http://www.oii.ox.ac.uk/research/projects/?id=70

Random map from the project:

Friday, May 4, 2012

Augmented information and the reproduction of visibility

I spend a lot of time thinking about the geographies of information that augment our planet (e.g. see a paper on augmented realities and uneven geographies that I recently wrote with Matt Zook). And many people ask me why the layers of information about place really matter. Who cares if London is covered by a denser cloud of information than Lagos? Why does it matter if some places are digitally mirrored by content that omits the voices of women, minorities, the oppressed, the invisible? Why do these augmentations really matter?


It is sometimes hard to come up with tangible examples of how the layers of information over place do truly matter. So, I want to share a quick story from an interview that I conducted in Kenya last week as part of our project about how altered communications capabilities are changing East Africa. A travel agent in Nairobi was telling me how the faster internet that he now has allows him to offer more regional tours to nearby countries like Rwanda and Uganda.

I then asked for more details about what this specifically meant. What is it that he is doing now that he couldn't before. The answer was 'Googling many more things - faster - more efficiently.' But I pressed the issue, what did this mean? What information was Google giving him that allowed him to learn more about new places and offer them as destinations to his clients. Well, the answer was actually Wikipedia. Google would lead him to relevant Wikipedia articles, and he would read about national parks, adventure activities and a range of other sights and then relay that information to his customers.

This may seem like a relatively trivial or uninteresting story. However, I think it nicely illustrates how geographic representations don't just influence how we think about places, but also, in a very real sense, influence how we move through, interact with, and enact place. 


(we also have a second paper on augmented realities that expands on some of these themes in more detail. I'd be happy to share a pre-publication draft if you'd like to get in touch)

Monday, April 30, 2012

Mapping #kony2012 on Twitter (part 2)

Following on from my last post about mapping #kony2012 on Twitter, I also wanted to offer up a map that shows the proportion of tweets from each country that made reference to the viral video or the LRA leader (or both).


We've already seen that most tweets referencing Kony were published from North America, and Western Europe, but this map tells a slightly different story. We now see that despite the very low total amount of content coming out of Sub-Saharan Africa (see the map below or read this post about where tweets come from), there is indeed a significant amount of focus on the Kony issue there.

But it remains that this is still a discussion that was largely taking place online in the Global North. While the top five countries in our sample (20% of all georeferenced tweets between March 5 and March 13, 2012) are: (1) USA; (2) Brasil; (3) Indonesia; (4) UK; (5) Mexico; and (6) Malaysia, the top five in terms of tweets that reference #kony are: (1)USA; (2) UK; (3) Canada; (4) Brasil; and (5) Ireland.

In other words, we see that while Twitter itself contains a lot of voices from the South, the discussion of Kony and Invisible Children remained firmly in the North.

***
Below I am also including a couple of maps showing the total number of tweets published during the same period:

Thursday, April 26, 2012

Mapping Wikipedia edits from South America

If you've ever wondered where edits to Wikipedia come from in South America, the map below might be useful to you. 

The answer is that almost half of all edits to Wikipedia from South America come from Brazil. These data are actually not that surprising considering that there are almost 80 million Internet users in the country (and Brazil is home to about 45% of all of South America's connected population).

Chile is the country that scores a lot better than would be expected. The country is home to only about 5-6% of the continent's Internet population, but contributes over 12% of Wikipedia edits.

I've finally merged the Wikipedia data with Internet penetration data, so will be creating maps normalised by each country's connected population as soon as I get a few more free hours.

related:

Monday, April 23, 2012

The geolinguistic contours of digital content in Spain

Following up on our post about augmented realities and uneven geographies, we wanted to post a few more maps that came out of the project.

This first one compares content indexed (by Google Maps) in Spanish (Castilian) to content in Catalan. Throughout much of the Catalonian region in the Northeast coastal areas there is considerable more content in Catalan than in Spanish.

The second compares content containing the word "love" in English and Spanish. The map reveals that while the Spanish term is much more predominant overall, there are clusters of locations along the Mediterranean coast at which there are more references to the English word.

These agglomerations are centered in tourism regions of Costa Brava, Costa Blanca, and the Andalusian coastline and closer inspection reveals that these concentration of hits are tied primarily to tourism related references to hotels, restaurants and other activities that are target to non-Spanish visitors.

One key thing that this map does then is reveal how the audiencing of augmentations can be alternately directed to a range of groups: ranging from the highly local (e.g. interpersonal relationships), to the global (e.g. tourist sites).

You can read more about the methods we used and our full conclusions in our new paper: "Augmented Realities and Uneven Geographies: Exploring the Geo-linguistic Contours of the Web."

Friday, April 20, 2012

O Mundo Pela Wikipédia


Some work done by Monica Stephens, Scott Hale and myself just got picked up by the Brazilian magazine Exame.


The spread offers an alternate visualisation to the data that we're collecting about the geographies of Wikipedia. It also includes penguins. I don't speak Portuguese, so am not sure what the penguins have to do with Wikipedia. But, being someone surrounded by virtual sheep, who am I to talk.

A PDF of the piece is here.

Monday, April 16, 2012

Jolie Holland - Catalpa Waltz (trying out Spotify's play button)

This is just a quick post to test out Spotify's new play button functionality. Spotify, more than any other online tool, file format, or technology, has fundamentally changed how I interact with music. Exploration and discovery become not just possible, but encouraged. And despite the fact that I used to both run an online radio station and work as a DJ at my college radio station, I'm pretty sure i've discovered more new (and brilliant) music in the last few years with Spotify's streaming service than at any other point in my life.

Anyway, if this play button gadget works well, I might try to share some playlists in the future.



(I've picked this track because I just discovered Jolie Holland's Catalpa a few days ago and haven't been able to stop listening to it since then)