Research Interests  

Ann Lee's interests are in developing statistical methodology and mathematical tools for complex scientific problems with high dimensions, heterogeneous structures and complex noise, which go beyond regression and classification. Her recent work (in collaboration with the CANDELS and LSST Dark Energy teams) involves comparing and estimating multivariate posterior distributions for galaxy images and photometric color data non-parametrically.

Publication List  

Journal Publications

Izbicki, R., and Lee, A. B. (2017) "Converting High-Dimensional Regression to High-Dimensional Conditional Density Estimation". Electronic Journal of Statistics 11(2):2800-2831, [pdf].

Freeman, P. E., Izbicki, R., and Lee, A. B. (2017) "A Unified Framework for Constructing, Tuning and Assessing Photometric Redshift Density Estimates in a Selection Bias Setting". To appear in Monthly Notices of the Royal Astronomical Society, [arXiv:1703.09242v1].

Izbicki, R., Lee, A. B., and Freeman, P. E. (2017) "Photo-z Estimation: an Example of Nonparametric Conditional Density Estimation under Selection Bias". To appear in Annals of Applied Statistics, [arXiv:1604.01339v1].

Lee, A. B., and Izbicki, R. (2016) "A Spectral Series Approach to High-Dimensional Nonparametric Regression". Electronic Journal of Statistics, 10:423-463 [pdf].

Izbicki, R., and Lee, A. B. (2016) "Nonparametric Conditional Density Estimation in a High-Dimensional Regression Setting". Journal of Computational and Graphical Statistics, 25(4):1297-1316 [preprint][appendix].

Gaugler, T., Klei, L., Sanders, J. S., Bodea, C. A., Goldberg, A. P., Lee, A. B., Mahajan, M., Manaa, D., Pawitan, Y., Reichert, J., Ripke, S., Sandin, S., Sklar, P., Svantesson, O., Reichenberg, A., Hultman, C. M., Devlin, B., Roeder, K., and Buxbaum, J. D. (2014) "Most genetic risk for autism resides with common variation". Nature Genetics Letter, 46:881-885 [main pdf] [suppl].

Izbicki, R., Lee, A. B., and Schafer, C. M. (2014) "High-Dimensional Density Ratio Estimation with Extensions to Approximate Likelihood Computation". Journal of Machine Learning Research (AISTATS track), 420-429 [main pdf] [suppl].

Freeman, P. E., Izbicki, R., Lee, A. B., Newman, J. A., Conselice, C. J., Koekemoer, A. M., Lotz, J. M., and Mozena, M. (2013) "New Image Statistics for Detecting Disturbed Galaxy Morphologies at High Redshift". Monthly Notices of the Royal Astronomical Society 434(1):282-295 [pdf].

Crossett, A., Lee, A. B., Klei, L., Devlin, B., and Roeder, K. (2013) "Refining Genetically Inferred Relationships Using Treelet Covariance Smoothing". Annals of Applied Statistics, 7(2):669-690 [pdf].

Richards, J. W., Lee, A. B., Schafer, C. M., and Freeman, P. E. (2012) "Prototype Selection for Parameter Estimation in Complex Models". Annals of Applied Statistics, 6(1): 383-408 [pdf].

Wang, W., Ozolek, J. A., Slepcev, D., Lee, A. B., Chen, C., and Rohde, G. K. (2011) "An optimal transportation approach for nuclear structure-based pathology". IEEE Trans. Med. Imag., 30(3):621-631 [pdf].

Buchman, S. M., Lee, A. B., and Schafer, C. M. (2011) "High-Dimensional Density Estimation via SCA: An Example in the Modelling of Hurricane Tracks". Statistical Methodology, 8(1):18-30 [pdf]

Lee, A. B., and Wasserman, L. (2010) "Spectral Connectivity Analysis". Journal of the American Statistical Association, 105(491): 1241-1255 [pdf]. Supplementary material [pdf]. For longer technical report, see arXiv:0811.0121

Lee, A. B., Luca, D., and Roeder, K. (2010) "A Spectral Graph Approach to Discovering Genetic Ancestry". Annals of Applied Statistics, 4(1): 179-202. [pdf]

Lee, A. B., Luca, D., Klei, L., Devlin, B., and Roeder, K. (2010) "Discovering Genetic Ancestry Using Spectral Graph Theory". Genetic Epidemiology, 34(1):51-59. [pdf]

Richards, J. W., Freeman, P. E., Lee, A. B., and Schafer, C. M. (2009) "Accurate Parameter Estimation for Star Formation History in Galaxies using SDSS Spectra". Monthly Notices of the Royal Astronomical Society, 399: 1044-1057. [arXiv:0905.4683]

Freeman, P. E., Newman, J. A., Lee, A. B., Richards, J. W., and Schafer, C. M. (2009) "Photometric Redshift Estimation Using SCA". Monthly Notices of the Royal Astronomical Society, 398: 2012-2021. [arXiv:0906.0995]

Richards, J. W., Freeman, P. E., Lee, A. B., and Schafer, C. M. (2009) "Exploiting Low-Dimensional Structure in Astronomical Spectra". Astrophysical Journal, 691:32-42. [pdf]

Freeman, P. E., Richards, J. W., Schafer, C. M., and Lee, A. B. (2008) "Astrostatistics: The Final Frontier". Chance, vol 21, no 3, pp. 31-35. [pdf]

Lee, A. B., Nadler, B., and Wasserman, L. (2008) "Treelets -- An Adaptive Multiscale Basis for Sparse Unordered Data". The Annals of Applied Statistics, vol 2, no 2, pp. 435-471. Discussion paper. [pdf]

Lee, A. B., Nadler, B., and Wasserman, L. (2008) "Rejoinder of: Treelets". The Annals of Applied Statistics, vol 2, no 2, pp. 494--500. [pdf]

Luca, D., Ringquist, S., Klei, L., Lee, A. B., Gieger, C., Wichmann, H.-E., Schreiber, S., Krawczak, M., Liu, Y., Styche, A., Devlin, B., Roeder, K., and Trucco, M. (2008) "On the Use of General Control Samples for Genome-Wide Association Studies: Genetic Matching Highlights Causal Variants", The American Journal of Human Genetics, 82:1-11. [pdf]

Lafon, S., and Lee, A. B. (2006) "Diffusion Maps and Coarse-Graining: A Unified Framework for Dimensionality Reduction, Graph Partitioning, and Data Set Parameterization". IEEE Trans. on Pattern Analysis and Machine Intelligence 28(9): 1393-1403. [pdf]

Coifman, R. R., Lafon, S., Lee, A.B., Maggioni, M., Nadler, B., Warner, F., and Zucker, S. (2005) "Geometric Diffusions as a Tool for Harmonic Analysis and Structure Definition of Data: Diffusion Maps". Proc. Natl. Acad. Sci. 102(21):7426-7431. [pdf]

Coifman, R. R., Lafon, S., Lee, A. B., Maggioni, M., Nadler, B., Warner, F., and Zucker, S. (2005) "Geometric Diffusions as a Tool for Harmonic Analysis and Structure Definition of Data: Multiscale Methods". Proc. Natl. Acad. Sci. 102(21):7432-7437. [ pdf]

Lee, A. B., Pedersen, K. S., and Mumford D. (2003) "The Nonlinear Statistics of High-Contrast Patches in Natural Images", International Journal of Computer Vision 54 (1-2): 83-103. [pdf ]

Srivastava, A., Lee, A. B., Simoncelli, and E. P., Zhu, S.-C. (2003) "On Advances in Statistical Modeling of Natural Images", Journal of Mathematical Imaging and Vision 18: 17-33. [pdf]

Lee, A. B., Mumford, D., and Huang, J. (2001) "Occlusion Models for Natural Images", International Journal of Computer Vision, 41(1/2): 35-59. [pdf]

Lee, A. B., Blais, B. S., Shouval, H., and Cooper, L. N. (2000) "Statistics of Lateral Geniculate Nucleus (LGN) Activity Determine the Segregation of ON/OFF Subfields for Simple Cells in Visual Cortex", Proc. Natl. Acad. Sci., November 7, vol. 97, no. 23, pp. 12875-12879.

Wahnstrom, G., Lee, A. B., and Stromquist, J. (1996) "Motion of  'hot' oxygen adatoms on corrugated metal surfaces". J. Chem. Phys. 105: 326-336.

Conference Proceedings  

Lee, A. B., and Freeman, P. E. (2011) "Exploiting Low-Dimensional Structure in Astronomical Spectra". In Statistical Challenges in Modern Astronomy V, Penn State University. [arXiv:1111.0911]

Lee, A. B. (2011) "Commentary on 'Data Compression Methods in Astrophysics' by Raul Jimenez". In Statistical Challenges in Modern Astronomy V, Penn State University.

Rohde, G. K., Wang, W., Slepcev, D., Lee, A. B., Chen, C., and Ozolek, J.A. (2010) "Detecting and classifying cancers from image data using optimal transportation". In 26th Southern Biomedical Engineering Conference, University of Maryland.

Lee, A. B. and Nadler, B. (2007) "Treelets -- A Tool for Dimensionality Reduction and Multi-Scale Analysis of Unordered Data", In Proc. of the Eleventh International Conference on Artificial Intelligence and Statistics (AISTATS*07), San Juan, Puerto Rico. [pdf]

Pedersen, K. S., and Lee, A. B. (2001) "Towards a Full Probability Model of Edges in Natural Images", ECCV (1) 2002: 328-342. [pdf]

Lee, A. B., Pedersen, K. S., and Mumford D. (2001) "The Complex Statistics of High-Contrast Patches in Natural Images", In Proc. of IEEE Workshop on Statistical and Computational Theories of Vision, ICCV 2001, Vancouver, CA, July 13. [ps]

Huang, J., Lee, A. B., and Mumford, D. (2000) "Statistics of Range Images".  In Proc. IEEE Conf. On Computer Vision and Pattern Recognition, CVPR 2000, Hilton Head Island , South Carolina , June 13-15. [ps]

Lee, A. B., Blais, B. S., Shouval, H., and Cooper, L. N. (1999) "Statistics of LGN Activity Determine the Segregation of ON/OFF Subfields for Simple Cells in Cortex". In Proc. of the Eighth CNS Conference (CNS 1999), Pittsburgh , PA , July 18-22.

Lee, A. B., and Mumford D. (1999) "An Occlusion Model Generating Scale-Invariant Images", In Proc. of IEEE Workshop on Statistical and Computational Theories of Vision, CVPR 1999, Fort Collins, Co, June 22.  

Technical Reports

Wu, W., Lee, A. B., and Mumford, D. (2003) "A Hierarchical Model for Minimum Entropy Data Partitioning", TR03-6, Pattern Theory Group, Division of Applied Mathematics, Brown University. [pdf]