All posts by Nicholas Reith

A Sociology PhD Student at the University of Texas at Austin, Nicholas is interested in comparative international sociology focusing on issues of migration, development and religion.

Screw the Models: A Talk on Data Dilemmas with Professor Alex Weinreb

Last week on Wednesday, 9 November, Professor Alex Weinreb gave a fascinating talk to an audience of graduate students and professors from the Department of Sociology here at UT Austin.

Professor Weinreb’s talk, entitled “Screw the Models, get back to the data: Or, on the disciplinary dangers of data ex nihilo” comes out of research he has been doing for his current book project on the mis-measurement of society.

The basic premise of Weinreb’s talk, which has potentially earth-shattering implications for the positivistic social sciences, is that there are manifold errors at the basic level of data collection, particularly in the third world, which may lead to mistaken results in quantitative studies that rely on survey-based research.

Weinreb contends that the past four or five decades of quantitative research have witnessed an impressive growth in the complexity of statistical modeling techniques, and in particular post-facto techniques for “data cleaning” that attempt to fix problems in survey data prior to analysis. Yet, since the 1950s, there has been little social science research aimed at assessing survey errors in the third world and finding better ways to get more accurate data.

He gave several shocking examples of how survey research has missed big conclusions due to data issues, despite fancy models. First, heavy-weight demographers in the 1980s completely missed the process of African fertility transitions, which was happening as they wrote, because their data were not adequate to the task they proposed. Second, a multi-million dollar cross-national survey of several Asian countries in the 1990s was unable to find any significant results regarding women’s autonomy, which contrary to theoretical assumptions seemed to be greater in patrilineal and patriarchal areas than in others.

In sum, whether social scientists miss something big, or are looking for something big and can’t measure it, the answers to both of these errors lie in the data, rather than in more complex models. And the essence of these errors lie in “non-sampling error”. In other words, even where sampling is perfectly random, or adjusted with appropriate weights, a number of other errors continue to creep into datasets. The shocking thing is that non-sampling error accounts for the majority of variance in most data.

Some types of errors that are commonly seen in third world surveys include variations in results due to translation issues, insider-outsider dynamics, male-female interviewer-interviewee dynamics, and privacy issues (whether or not a third party is present during the interview).  Some of Weinreb’s recent work in the Dominican Republic has shown the variation in results when interviewers know the respondent personally, are an unknown community insider, or are an outsider to the community. In one illustrative example, respondents were much more likely to falsely claim to know a fictitious person among a list of real personnages when interviewed by an outsider, something Anthropologists and ethnographers call “sucker bias.” But the problem is that the directionality of any such bias is not consistent. On one set of questions respondents may be more likely to give accurate answers to outsiders than insiders (or to men rather than women), and vice versa for another set. And in another country or part of the same country, or even among different gender and age groups, this situation may be completely different.

There are clearly no easy solutions for these data dilemmas. But further research into data collection methods is one key to improving results. Although there have been advances in data collection methods in the developed world, with regards to the third world, this type of research has stalled since the 1950s and 1960s after World Fertility Surveys in the 1970s, and later Demographic & Health Surveys from the 1980s until today became the nearly universal gold standard for survey research. Having one such standard method for survey research does have the benefit of comparability across place and time. However, it seems to have serious problems of accuracy since it does not get the best results in all contexts. In other words, the current position of the social science academy is to privilege reliability of data over validity for a number of reasons, including comparability and tradition.

Following his talk, a number of graduate students and professors engaged in a lively discussion with Weinreb, parsing out the details of some of his broader brush strokes, and debating the pro and contra of some of his hypotheses. Department Chair and Professor Christine Williams pointed out that many of these data dilemmas were critiques often leveled by qualitative researchers at quantitative research in general, but noted that it was important to see this type of nuanced discussion from within a quantitative framework, since it is essential to always improve all of our research methods. Professor Pam Paxton, alluded to the conversation from the week before at the brown bag presentations of graduate students Amanda Stevenson and Isaac Sasson, pointing out that there are a number of solutions to data problems, even issues of non-sampling error, if researchers take the time to thoroughly diagnose problems and deal with them. She also pointed out that there is an accountability mechanism built into this type of survey research in the form of policies and programs, which are often informed by such social science research and ultimately prove successful or unsuccessful.

Isaac Sasson turned the discussion toward the future of the field and wondered aloud about how these ideas affect the big picture of the knowledge of structure and the structure of knowledge. And graduate student Marcos Perez ruminated about interdisciplinary linkages and the ways in which some of these issues could be solved through collaboration with colleagues in other departments.

Although we don’t yet have the answers to a number of these questions, Professor Weinreb’s work did shed some light on the problems of data assumptions ex nihilo (out of nothing), which ignore non-sampling error. Given the entrenched nature of quantitative research traditions, this is likely to be but a quiet revolution in the social sciences in the immediate future, but a revolution nonetheless, and one to keep our eyes on.

State-Building and Property Regimes in Africa: A Talk with Catherine Boone of the UT Department of Government

Professor Catherine Boone of the Department of Government at UT-Austin spoke to an audience of graduate students and professors from Sociology, Government and other disciplines recently about her latest research on territorial politics and rural property regimes in contemporary Africa.  The talk, which took place on Friday, October 21st 2011, was the latest event organized by Power, History, and Society (PHS), a faculty-student network at UT, founded by graduate students and faculty in the Department of Sociology and  led by Professor Maya Charrad.  Sociology graduate student Christine Wheatley served as discussant, responding briefly to Professor Boone’s talk and setting the stage for the Q & A that followed. Other graduate student members of the PHS network assisted in organizing the talk, including Nicolette Manglos, Nicholas Reith, and Julie Beicken.

Professor Boone began her talk by giving a brief overview of the history of political science and political sociology scholarship on Africa.  Since structuralism’s apex in the 1970s and 1980s its influence within research on politics has declined.  This, coupled with the complexities of African societies, led many to conclude that there was little “structure” to be found on the African continent characterized by fictive “free peasants.”  In this context, where over 70 percentof Sub-Saharan Africans still live in rural areas, our scholarly understanding of rural social structures and agrarian-state relations is still woefully incomplete.  Those most aware of the complexities and intricacies of African societies have been anthropologists, who are often anathema to structuralist approaches.   Others have continued to spin behaviorist, voluntarist, culturalist and neo-patrimonial theories of African politics.

Thus, in her latest book project, Professor Boone attempts to make a structuralist and institutionalist argument about the ways that certain land tenure regimes, which govern access to land and vary across national and sub-national spaces, can have stark political effects on the scale and scope of political conflicts.  She proposes a typology of land tenure regimes that consists of three distinctive forms: familial land holding, local and regional chieftaincies, and statist regimes, where direct agents of the central state control land allocation.  These variations in land tenure regimes, she argues, produce two important and related political effects. First, they influence the scale and scope of redistributive conflicts around land. For example, if land allocation is controlled by family/lineage, then disenfranchised persons must limit their grievances to the family.  Second, they produce geographic unevenness in local possibilities for national citizenship, political voice in the national arena, and liberal democratic representation at the national level. Thus, if conflicts are limited in scale and scope beneath the national level, so is political participation of citizens involved in the conflict at the national level.

This comparative and structuralist argument is timely and has broad implications beyond explaining current political conflicts in Africa.  In the context of globalization and the neo-liberal pressure for the state to retreat from arenas it once controlled, it sheds light on the various effects of decentralization, helping us to compare and contrast its political, social, and economic costs and benefits.  It also shows how less centralized rural property regimes– whether at the level of extended family/lineage or chieftaincies, while problematic in several ways, seem to serve as the last line of defense against the looming threat of land dispossession by the global market.  As land values rise and there is increased pressure from international buyers, a centralized state system of land tenure may make the market so “efficient” as to make it even easier to drive current inhabitants off of the land that they have long occupied.  It also broadens the potential for wider, national level conflicts.

Sociology Professors Alex Weinreb and Nestor Rodriguez as well as a number of graduate students from the Departments of Sociology and Government and other departments engaged Boone in a spirited discussion and debate inspired by her research.  The discussion revolved around comparative methodology, as well as the question of critical historical antecedents, in particular, the question of how far back in history it is necessary to engage in order to make an argument of structural causation.  Other lucid comments focused on comparisons between pre-capitalist Europe and Africa, which clearly differ in numerous ways, yet lend themselves to structuralist arguments of different types because of similar historical processes of land enclosure and dispossession.

Both the Power, History and Society Network and the Department of Sociology would like to thank Professor Boone for her excellent and engaging talk.  And we look forward to future cross-departmental exchanges and PHS events with others.

“Epidemiography”… Say What?

Sociologists have long been familiar with metaphors, analogies, and theories imported to the social realm from the natural sciences. One of our founders Durkheim viewed society as a living “social organism.”

Fast forward to 2011 where the proliferation of new communications technologies such as Twitter have once again spawned (pardon my biological lingo) new comparisons with the biological realm.

In his recent blog post, Anthropologist John Postill expounds upon some of the basic ideas of his new book Democracy in the age of viral reality: a media epidemiography of Spain’s indignados movement, which we may find applicable to the socially networked social movements/protests/revolts/revolutions happening around the world.

Many of us have already become familiar with the term “viral” when referring to the way comic and political videos quickly become popular on youtube. But Postill applies this analogy to social protests, believing that social media have been a game changer that created a “new media ecology” that foments “outbreaks” of this virus of opposition that regimes in the Arab world and even governments in the developed world have found difficult to “quarantine.”

Contrast this media ecology view with the more material perspective of the Latin American intellectual and journalist Raúl Zibechi who was quoted here as saying:

I don’t believe in virtual spaces, spaces are always material as well as symbolic. It’s another matter to speak of virtual media of communication among people in movement….

This raises the question of the material cost of protest and whether online social media actually constitute a “space” for protest, or rather are simply the latest tool in a long line of printing presses, telegraph machines, telephones, radios, fax machines, cassette tapes, satellite television, and other media used to spread messages of dissent and mobilize protesters and revolutionaries.

Ithiel de Sola Pool seemed to lean more toward the latter in his famous book, Technologies of Freedom.

Is becoming a facebook fan of a protest page anything other than simply a convenient barometer for public sentiment? Is it a viral outbreak of resistance that must be “quarantined”, or is armchair revolution simply not revolution?

In the Egyptian protests last January, Mohammed Bamyeh notes here that some of the most critical moments that led to the downfall of Hosni Mubarak came during the period when the government had actually shut down all internet in the country, and protesters used more traditional means to organize and still were able to have millions protesting around the country.

Contrast this with the “viral” Occupy Wall Street movement, which uses the slogan “We are the 99%”, but is only able to get a thousand or so at a time to show up, even with the full force of social media, the organization of volunteers providing food, and the attraction of performers coming to put on concerts for the protesters.

In the end of the day, is the difference between Tahrir Square and Zucotti Park, even after more than 40 years of authoritarian rule in Egypt, more about family, neighborhood and community than it is about technology? Is it possible to organize a mass social movement in a society that is “bowling alone”?

STATA Geek Out – Tables with outreg2

Aside from the very interesting theoretical and political-sociology oriented posts of late, some of us at the UT Austin Soc blog would also like to encourage other types of posts with a more methodological angle.

Since many of us use STATA for statistical work, I thought a series of posts on STATA tips and tricks would be a good place to start our “geek out” and share some time-saving, or just plain cool commands.

So here I’m going to give a little bit of sample code for getting tables and graphs out of STATA for a more manageable look at results.

TABLES

When it comes to tables, there are a number of useful programs built in to STATA to export results of regressions and other data. STATA 12 now comes with an improved menu button for exporting certain parts of the raw data to excel. “Tabout” is a useful tool for creating summary excel tables of tabbed data, for example average income by gender, if the data is from a certain country.

But many times it is not simple data or tabs that we want to see in excel, but rather more complex results of our regressions. Our eyes can only take so much of staring at the output window and it is hard to make connections without seeing things in neat tables. Sometimes we even need to create publication quality tables to insert into articles.

Now, copying and pasting and formatting by hand is always an option. But over the course of just one project, not to mention an entire PhD program, the countless hours spent making revisions by hand until perfection seem to justify the short-term time-expenditure on learning how to automate tables in your STATA code.

For this purpose then, there are two excellent little programs called “estout” and “outreg”. “estout” enables you to output a specific set of regression (or other analysis) results after first saving them with “esttab.” “outreg” is a bit more automated, and in fact my favorite, which I will demonstrate here is “outreg2”, which has more bells and whistles and seems to work well even with more advanced models beyond simple regression.

For this purpose, I will use a simple country level data set collected from the World Bank website, which includes three variables: 1) country, 2) hiv (average hiv infection rate for the past 5 years), and 3) pov (average poverty rate for the past five years).

DISCLAIMER: I came across the question of whether poverty has a significant effect on HIV infection rates in some development literature, much of which assumes that the two are linked. However, the jury is still out and this simple regression exercise does not in any way claim to offer answers. Rather it aims only to demonstrate some techniques for data analysis in STATA. Statistics probably can tell us something about this question, but for that, a much more complicated model would be appropriate.

So, there are several steps to get our neat excel output. (In case you are totally new to STATA, note that the actual code you type comes on the lines below that begin with periods, and what follows is the output.)

First, we run our basic regression:

. reg hiv pov

      Source |       SS       df       MS              Number of obs =      61
-------------+------------------------------           F(  1,    59) =    0.76
       Model |  19.0177362     1  19.0177362           Prob > F      =  0.3873
    Residual |  1478.91636    59  25.0663791           R-squared     =  0.0127
-------------+------------------------------           Adj R-squared = -0.0040
       Total |   1497.9341    60  24.9655684           Root MSE      =  5.0066

------------------------------------------------------------------------------
         hiv |      Coef.   Std. Err.      t    P>|t|     [95% Conf. Interval]
-------------+----------------------------------------------------------------
         pov |  -.0190502   .0218708    -0.87   0.387    -.0628136    .0247133
       _cons |     3.1443    .970026     3.24   0.002     1.203282    5.085317
------------------------------------------------------------------------------

Next, we can call up a list of the estimates obtained from the above regression.

. ereturn list

scalars:
                  e(N) =  61
               e(df_m) =  1
               e(df_r) =  59
                  e(F) =  .7586949891917172
                 e(r2) =  .0126959765324349
               e(rmse) =  5.006633506264527
                e(mss) =  19.01773619459277
                e(rss) =  1478.916364896987
               e(r2_a) =  -.0040379899670153
                 e(ll) =  -183.795077505511
               e(ll_0) =  -184.1847839095903
               e(rank) =  2

macros:
            e(cmdline) : "regress hiv pov"
              e(title) : "Linear regression"
          e(marginsok) : "XB default"
                e(vce) : "ols"
             e(depvar) : "hiv"
                e(cmd) : "regress"
         e(properties) : "b V"
            e(predict) : "regres_p"
              e(model) : "ols"
          e(estat_cmd) : "regress_estat"

matrices:
                  e(b) :  1 x 2
                  e(V) :  2 x 2

functions:
             e(sample)

With these estimates we can use outreg2 to create a simple table.

. outreg2 using OUTPUT_hiv_pov_countries, e(N df_m F rss ll) excel replace
OUTPUT_hiv_pov_countries.xml
dir : seeout

So, it’s as simple as that. Just run your analysis, call up the list of estimates, and plug those in to have outreg2 create a nice excel table like the one below. As you can see, one cool feature of outreg2 is that it automatically adds 1, 2 or 3 stars to your estimates in order to indicate whether they are statistically significant at the .1, .05, or .01 levels. This is not only a publication convention, but is also very useful for a quick eyeball look at your results, to see if you are on the right track.

NOTE ON AUTOMATION:

When you are running a large number of analyses however, it is useful to note a few things about automating outreg2.

1) Advanced formatting: Type “help outreg2” and take a closer look at the advanced features in order to be able to play with the formatting. This can save you from having to format every excel table by hand.

2) Replace: When running a number of analyses, for example the same regression over and over on individual countries, or separately for men and women… using the “replace” option on the very first analysis will make sure that you save over the old versions of your excel file when you re-run your code with the latest tweaks.

3) Append: Using the append option for your outreg2 code after each additional analysis that you wish to include in the same excel file will ensure that you have one big comparable table, which will list results for other regressions right along side the first one.

This was just a simple example to show outreg2 in action. However, one nice thing about this command is that it works well with more advanced analyses as well, including multi-level models, and can give you additional statistics such as Inter-Class Correlations… Basically, anything you can get STATA to estimate will appear in the “ereturn list” and can be outputted with outreg2.

Although it’s a bit basic, I hope you found this little geek out useful.

Please share with us some of your favorite STATA tips and tricks either in the comments, or perhaps as a guest blogger.

Wall Street’s “American Spring”

A recent blog article here and news articles here and here report on the ongoing protests that have occupied Zucotti Park in lower Manhattan’s Wall Street financial district. Modelled on and inspired by the recent events of the “Arab Spring,” this organized anti-capitalist protest aims to occupy and shut down Wall Street, in protest over the excessive greed of Wall Street and its involvement in US and world politics. Some speaking for the protesters say the goal is to mount a permanent protest and highlight the failure of big business and the government to propose serious solutions to the problems the country (and the world) are facing in the current economic crisis.

While it is too early to fully understand this leaderless and seemingly amorphous social movement, some initial observations indicate the ways in which it is similar and dissimilar to the Arab revolts in Tunisia and Egypt that inspired it. First, the Wall Street protests have been aided even more by social media technology than their Arab counterparts, and have drawn in some people from around the country and not only New York City.  Second, the common denominator among the protesters seems to be that they are young, “over-educated, under-employed, and angry” as the British newspaper The Guardian put it here. This is not so clearly the case in the Arab protests, which enjoy a much broader representation in society. To a certain extent then, we can say that these Wall Street protesters are middle class (read bourgeois) protesters with a curiously leftist and anti-capitalist message, and until now at least they have not yet been able to draw in the much broader participation of lower socio-economic classes, although they may have their sympathy. At first glance, this might lead us to question Marxist assumptions about whether intellectuals and activists generally act in the interest of their class. However, a closer look at some of the signs and slogans held aloft by the protesters may indicate that there are many sub-segments of the bourgeoisie who aren’t being well-served by the current economic system. The fact that students loans have recently surpassed credit-card debt as the largest source of debt in American society, at a time when unemployment is at its highest level in years means there is a “critical mass” of disaffected youth who no longer believe current social, political and economic arrangements hold promise. However, a couple of “critical” questions are: “How critical?” and “Critical for what?”

Whatever the outcomes of these Wall Street protests and their occupation of Zucotti Park, renamed “Liberty Park” in homage to Egypt’s Tahrir (Liberty) Square, this should be interesting fodder for Sociologists interested in social movements particularly in the context of globalization and social media’s rapid spread of ideas.

The tables are turned: Chinese foreign economic policy in comparative perspective

A recent blog article by Nasos Mihalakas, entitled China’s Efforts to Internationalize its Currency resonated strongly with the recent course readings I have been doing for Mounira Maya Charrad’s course on Comparative Historical Sociology. In this class we have been working our way through Barrington Moore’s magnum opus, Social Origins of Dictatorship and Democracy: Lord and Peasant in the Making of the Modern World.

Moore’s central argument builds on a Marxist class analysis framework, arguing however that there are no foregone conclusions when it comes to the origins of political regimes and that the particular material economic and political conditions of a country at a crucial historical moment will largely determine whether the outcome is something akin to democracy, fascism or communism.

Although the foreign meddling of Europe in China’s politics and economy is not a central feature of the more domestically oriented argument in his chapter on China, he does remind us of how quickly the Chinese Imperial system crumbled in just a century and how the margin for manoeuvre of the rulers at that time was severely restrained by onerous treaties imposed by the British, which basically placed Chinese foreign economic and trade policy in the hands of foreign capitalists.

The current situation is somewhat ironic then, in that China has full control over their economic and fiscal policy and Europe and the United States have been complaining for some time about its policies to keep China’s currency (the “Renminbi”) artificially low by pegging it to the dollar and tightly controlling currency circulation within China.

While this has served China well for export-led growth, it seems China’s government believes this reliance on the dollar has led to some of the recent instabilities in the world economy, and we may be entering a new era in which Chinese currency is becoming increasingly internationalized as a possible counter-balance or alternative currency to the dollar.

This has major implications for the world economy in the current globalized context, but its dimensions could not possibly be fully understood without some reference to the historical context of the past two centuries and how China’s economic and trade policy was run at that time, something which Barrington Moore makes abundantly clear.

This short post isn’t a sufficient space to flesh out these ideas further, but this brief excerpt below gives an idea of the main thrust of Mihalakas’ article, and I thoroughly recommend a look at Barrington Moore’s Chapter IV: “The Decay of Imperialist China and the Origins of the Communist Variant.”

The Chinese government pegs the RMB to the dollar so the powerful and wealthy export sector can continue selling in Europe and America (and thus employment stays high).  The government also maintains strict capital controls in order to prevent inflation from hurting the vast lower and middle class.  China’s currency has become a modern-day opium, and the authorities have been searching for a way out of their current economic model which relies on growth from an undervalued currency and capital controls.  Internationalizing the RMB offers one such exit.

Eventually, wider use of the RMB outside China could redefine the balance of power in global currency markets, as the rest of the world begins trading more RMB-based assets and settling its bills with China in RMB instead of the U.S. dollar.  Beijing gets to keep its currency system, while gaining economic leverage and diplomatic legitimacy around the world.

See the link to Mihalakas’ article above.

Of course, more recent scholarship is also necessary to bridge the gap between the pre-communist and the current hyper-capitalist phases of China’s history. Still, the historical comparison seems an interesting one.