mailing address: 132G Baker Hall
Department of Statistics
Carnegie Mellon University
Pittsburgh, PA 15213
tel: (412) 268-2723
fax: (412) 268-7828
Selected Professional Activities:
Institute of Mathematical Statistics: President, 1998-1999; Past President, 1999-2000.
Editor of the Annals
of Applied Statistics for Social Science, Government and
Co-Founder of Journal of Privacy
and Confidentiality, 2006-.
International Encyclopedia of the
Social and Behavioral Sciences, Co-editor Section for
Statistics, published November 2001.
(For a list of authors and their entries in the encyclopedia see .http://www.iesbs.com/author.html. Click on Authors in left hand panel.)
International Society for Bayesian
Analysis, President 1996-1997; Past-President, 1998.
Chair, Selection Committee for the DeGroot Prize, 2001-2004.
Co-chair National Academy of Sciences, Report Review
Thorsten Sellin Fellow of the American Academy of
Political and Social Science, 2004.
Elected member of The National Academy of Sciences.
Elected fellow of Royal Society of Canada.
Elected fellow of the American
Academy of Arts and Sciences.
Fellow of American Association for the Advancement of Science,
American Statistical Association, Institute of Mathematical
36-149: "Freshman Seminar: Taking the U.S. Census in 2010"
Statistics 36-720: Discrete Multivariate Analysis (Fall, 2009, 1st mini)Statistics 36-835: Statistical Models and Methods for Networks (Fall 2007)
36-756 Advanced Statistics II (Fall, 2006)
Statistics 36-315 Statistical Graphics and Visualization (Spring 2006)
36-303: Sampling, Surveys, and Society (Spring, 2003)
My principal research interests lie in the development of
especially for problems involving categorical variables. Initially, I
on the general statistical theory of loglinear models for categorical
and I applied the theory to various problems that could be represented
the form of multidimensional contingency tables. More
recently, I have studied approaches appropriate for disclosure
limitation in multidimensional tables and their relationship with
results on bounds for table entries given a set of marginals (for
selected publications on this topic see Disclosure
Limitation Papers, as well as the webpage for the NISS Digital Government
Project on this topic), estimating the size of
populations (especially in
the context of census taking), and Bayesian approaches to the
analysis of contingency tables. My research on disclosure
limitation for categorical data, and on privacy and confidentiality
more generally, has led to the creation of the online journal, The Journal of Privacy and
Confidentiality, which has published its first issues.
For some interesting historical material on the model for quasi-symmetry and the work of Henri Caussinus, see Project QS,and the special issue of Annales de la Faculté des Sciences de l'Université de Toulouse Mathématique in honor of Caussinus dated 2002.
For several years now, I have also worked on the development of statistical methods for large-scale sample surveys such as those carried out by the federal government. This work (much of which has been in collaboration with Judith Tanur) has included the study of nonsampling errors, the use of surveys to adjust census results for differential undercount, cognitive aspects of the design of survey questionnaires, statistical analysis of data from longitudinal surveys, and formal parallels in the design and analysis of sample surveys and randomized experiments. My recent book with Margo Anderson, Who Counts? (which has now appeared in a revised paper back edition), chronicles the story of the the 1990 decennial census and efforts to use sample to adjust census results for differential undercount. My work on confidentiality and disclosure limitation ties both to surveys and censuses and also to categorical data analysis (again see the webpage for the NISS Digital Government Project on this topic as well as some of the selected papers below), and also addresses public concerns about privacy. For a July 2001 news story on the topic of privacy in the Pittsburgh Post-Gazette, click here.
In the analysis of data from longitudinal studies of
disability, such as the National Long Term Care Survey, a number of
authors have used novel statistical methodology based on what has come
to be known as the Grade of Membership (GoM) model. Working
with students and colleagues, I have been exploring the GoM model, its
estimation, and comparisons between it and other categorical data
models. We have also begun to look
at confidentiality issues arising in the context of the
NLTCS. Some of our work is available on a separate
GoM Model, and Confidentiality.
I have also been active in the application of
statistical methods to
legal problems and in assessing the appropriateness of statistical
testimony in actual legal cases, and I have linked my interests in
Bayesian decisionmaking to the issues of legal decisionmaking.
For information on the NAS Sackler Symposium on Forensic
Science, held November 16-18, 2005, click here.
See also the following little piece in the Pittsburgh
interest is the report of the NAS-NRC Committee to Review the
Scientific Evidence on the Polygraph,
which was released on Octber 8, 2002 (for further details see below).
My work on statistical methods for multiple-media data (including data in the form of pictures, images, video, sound, symbols, and text.) is part of a larger research effort that is taking place in Carnegie Mellon's Maching Learning Department.
Bishop, Y.M. M., Fienberg, S.E. and Holland, P.W. (1975). Discrete
Analysis: Theory and Practice. M.I.T. Press, Cambridge, MA.
edition (1977). A Citation
Classic. Reprinted, by Springer-Verlag, New York
Fienberg, S.E. (1980). The Analysis of Cross-classified
Categorical Data. 2nd Edition. M.I.T. Press, Cambridge, MA. A
Classic. Reprinted, by Springer-Verlag, New York
DeGroot, M.H., Fienberg, S.E., and Kadane, J.B., eds. (1986). Statistics
and the Law. Wiley, New York. Wiley Classics Paperback
Fienberg, S.E., ed. (1989). The Evolving Role of Statistical Assessments as Evidence in the Courts. Springer-Verlag, New York.
Fienberg, S.E., Hoaglin, D.C., Kruskal, W.H., and Tanur, J.M., eds. (1990). A Statistical Model: Frederick Mosteller's Contributions to Statistics, Science, and Public Policy. Springer-Verlag, New York.
Meyer, M.M. and Fienberg, S.E., Editors (1992). Assessing Evaluation Studies: The Case of Bilingual Strategies. Panel to Review Evaluation Studies of Bilingual Education. National Academy Press, Washington, D.C. http://www.ulib.org/webRoot/Books/National_Academy_Press_Books/bilingual_education/bili001.htm
Devlin, B., Fienberg, S.E., Resnick, D.P., and Roeder, K., eds. (1997). Intelligence, Genes, & Success: Scientist Respond to The Bell Curve. Copernicus (Springer-Verlag) New York.
For further details on this book, click: INTELLIGENCE, GENES AND SUCCESS, and for a table of contents, click here.
Anderson, M. and Fienberg, S.E. (1999). Who Counts? The Politics of Census-Taking in Contemporary America, Russell Sage Foundation , New York. Revised paperback edition (2001).
details on this book, click here: WHO COUNTS?
Foster, A.W., Mitchell, F., and Fienberg, S.E., Editors (2002). Measuring Housing Discrimination in a National Study: Report of a Workshop. National Academy Press.
To see an online
Committee to Review the Scientific Evidence on the Polygraph (2003). The Polygraph and Lie Detection. National Academy Press.
For further details on this book, click here: http://www.nap.edu/catalog/10420.html?onpi_topnews_100802
For selected news coverage of
the release of the Polygraph Report see: Click here.
For the Department of Energy's April 2003 Response to the report and
proposed policy see:
For information on the September
2003 Senate committee oversight hearing on the DoE polgraph program,
including my testimony, see:
Airoldi, Edoardo M., Blei, David M., Fienberg, Stephen E.,
Goldenberg, Anna, Xing, Eric, and Zheng, Alice (eds.) (2007) Statistical Network
Analysis: Models, Issues, and New
Directions: ICML 2006 Workshop on Statistical Network Analysis,
Pittsburgh, PA, USA, June 29, 2006, Lecture Notes in
Computer Science, Vol. 4503, Springer-Verlag.
Committee on Technical and Privacy Dimensions of Information for Terrorism Prevention and Other National Goals (2008). Protecting Individual Privacy in the Struggle Against Terrorists: A Framework for Program Assessment. National Academies Press.
Fienberg, Stephen E. , Hoaglin, David C., and Tanur, Judith M. M.,
eds. (2010). The Pleasures of
Statistics: The Autobiography of Frederick Mosteller.
Springer, New York.
Selected Recent Papers:
Fienberg, S.E. (2000). "Contingency tables and log-linear models: Basic results and new developments." Journal of the American Statistical Association,95, 643-647.
Anderson, M., Daponte, B.O., Fienberg, S.E., Kadane, J.B., Spencer, B.D., Steffey, D. (2000). "Sample-based adjustment of the 2000 census--A balanced perspective," Jurimetrics, 40 , 341-356.
Dobra, A. and Fienberg, S.E. (2000). "Bounds for cell entries in contingency tables given marginal totals and decomposable graphs," Proceedings of the National Academy of Sciences, 97, No. 22, 11885-11892. For a pdf version of this paper, click here.
Duncan, G.T., Fienberg, S.E., Krishnan, R.,Padman, R. and Roehrig, S.F. (2001). "Disclosure Limitation Methods and Information Loss for Tabular Data." In Confidentiality, Disclosure and Data Access: Theory and Practical Applications for Statistical Agencies. (P. Doyle, J. Lane, J. Theeuwes, and L. Zayatz, eds.), Elsevier, 135--166. Prepublication version available from http://www.niss.org/dg/technicalreports.html .
Goldenberg, A., Shmueli, G., Caruana, R. & Fienberg, S.E.
statistical detection of anthrax outbreaks by tracking
over-the-counter medication sales. Proceedings of the
National Academy of Sciences , 99, 5237 - 5240. For a pdf version
of this paper, click here.
Erosheva, Elena A., Fienberg, Stephen E., and Junker, Brian W. (2002). "Alternative statistical models and representations for large sparse multi-dimensional contingency tables." Ann. Fac. Sci. Toulouse Math. (6) 11, no. 4, 485-505. For a copy click here.
Erosheva, Elena A., Fienberg, Stephen E., and Lafferty, John (2004). "Mixed-membership models of scientific publications," Proceedings of the National Academy of Sciences, 97, No. 22, 11885-11892. For a pdf version of this paper, click here.
Fienberg, Stephen E. and Slavkovic, Aleksandra B. (2005).
Preserving the Confidentiality of Categorical Data Bases When
Releasing Information for Association Rules." Data Mining and Knowledge Discovery,
11, 155-180. For a pdf version of this paper, click
Fienberg, Stephen E. (2006). "When Did Baysian Inference Become
"Bayesian"?" Bayesian Analysis,
1, 1-40. For a pdf version of this paper, click
For a related annotated bibliography, see:
Fienberg, Stephen E. (2005). A “Bayesian Classics” Reading List. ISBA Bulletin, 12(3), September 2005, 9-14.
For a pdf version, click here.
Airoldi, Edoardo M., Anderson, Analise G., Fienberg, Stephen E., and
Skinner Kiron K. (2006). "Who Wrote
Ronald Reagan's Radio Addresses?,"
1, 289-320. For a pdf version,
For a related article, see: Edoardo M. Airoldi, Stephen E. Fienberg, and Kiron K. Skinner (2007). "Whose Ideas?
Whose Words? Authorship of the Ronald Reagan Radio Addresses." PS: Political Science & Politics,
Volume 40, Issue 03, (July), 501-506.
Fienberg, Stephen E. (2006). Privacy and Confidentiality in an e-Commerce World: Data Mining, Data Warehousing, Matching and Disclosure Limitation.. Statistical Science, 21, 143-154. For a pdf version, click here.Araneda, Anita, Fienberg, Stephen E. and Soto, Alvaro (2007). "A statistical approach to simultaneous mapping and localization for mobile robots," Annals of Applied Statistics, 1, No. 1, 66-84.
E. (2007). "William
Kruskal: My Scholarly and Scientific Model, " 22, No. 2, 266–268.
For other comments by N.M. Bradburn, M.L. Eaton, L.A. Goodman, M.E. Martin, S.M. Stigler, and J.M. Tanur, see
Fienberg, Stephen E. (2007). "Editorial --
Expanding the statistical toolkit with algebraic statistics."
Statistica Sinica, 17(4), 1261-1272.
Erosheva, Elena A., Fienberg, Stephen E., and Joutard, Cyrille
(2007). "Describing disability through individual-level mixture models
for multivariate binary data," Annals
of Applied Statistics, 1, No. 2, 502-537.
Fienberg, Stephen E. and Kim, Sung-Ho (2007). "Positive association
among three binary variables and cross-product ratios," Biometrika, 94,
Fienberg, Stephen E. (2008). "The Early Statistical Years:
1947–1967. A conversation with Howard Raiffa," Statistical Science, 23, No. 1,
Jackson, L. Fraser, Gray, Alistair G., and Fienberg, Stephen
E. (2008). "Sequential category aggregation and partitioning
approaches for multi-way contingency tables based on survey and census
data," Annals of Applied Statistics,
2, No. 3, 955-981.
Airoldi, Edoardo M., Blei, David M., Fienberg, Stephen E., and Xing,
Eric P. (2008). "Mixed membership stochastic blockmodels," Journal of Machine Learning Research,
Last updated: 1/11/10.