Sequoia Hall
Home Academics Research Seminars Consulting Industrial Affiliates People
    
Computing Guide
Our Brochure
Contact Us
Sitemap
Links



Our Faculty Research Interests

Previous Our Brochure Contents Next

For a broader view, we asked each of our other faculty to describe their interests briefly. To learn more, you'll have to phone, e-mail, write or visit! We have included in this list emeriti and some faculty in other departments who have courtesy appointments in Statistics, as they all play an active role with students and in research in the Statistics department.

Theodore W. Anderson

One of my areas of interest is multivariate statistical analysis, which is the analysis of data consisting of multiple measurements on each observational unit. Macroeconomic and psychometric data usually have this nature. I have developed methodology for analyzing such data. Many of the procedures involve reducing the dimensionality of the important variability. A class of such methods arises in the multivariate analysis of variance.

Another area of my interest is the analysis of time series, data that accrue over time. Currently I am studying statistical inference for spectral distributions of stationary statistical processes and large-sample theory for autoregressive models.
top of page

Amir Dembo

My Ph.D. in Digital Signal Processing (Electrical Engineering) was followed by research on the analysis of Neural Networks. In time I became more interested in probability theory and its applications. This is what I am doing now, half-time in the statistics department and half-time in the mathematics department.

A few examples are:

  1. Assessing which seemingly rare segments of DNA (or protein) may be due to pure chance (jointly with Sam Karlin from the mathematics department).
  2. Studying relationships between uncertainty inequalities in information theory, statistics, mathematics and physics (jointly with Tom Cover and Joy Thomas from this department and IBM, respectively).
  3. With Ofer Zeitouni (Technion, Israel), writing a book on the theory of large deviations and its applications.

top of page

Jerome H. Friedman

I am interested in how to use the capabilities of modern day computers to most effectively learn from data. These capabilities include very rapid computation and sophisticated (dynamic) graphics for scientific visualization. The goal is to develop computer based tools that can extend our ability to analyze data beyond that provided by the standard repertoire of statistical techniques that were developed before the existence of readily available computing.

One such area that is receiving considerable recent attention is machine learning ("neural networks"). Here one has a system under study that responds to a set of simultaneous input signals. The response is characterized by a set of output signals. The goal is to learn the relationship between the inputs and the outputs in the most general way possible. This exercise generally has two purposes: prediction and understanding. With prediction one is given a set of input values and wishes to predict or forecast likely values of the corresponding outputs without having to actually run the system. Sometimes prediction is the only purpose. Often, however, one wishes to use the derived relationship to gain understanding of how the system works. Such knowledge is often useful in its own right, for example in science, or it may be used to help improve the characteristics of the system, as in industrial or engineering applications.

Another area that modern computing has made possible is data exploration through graphic visualization. Here the goal is to explore data to discover the unexpected. One attempts to use the human gift for pattern recognition to detect un usual patterns in the relationships among the measured quantities. This human gift is very powerful at seeing (previously) undefined effects; one can be surprised at seeing something that was not at all anticipated, leading possibly to a new discovery. Unfortunately the human gift for pattern recognition is limited to low (two to three) dimensions, whereas the data often involve many more than three simultaneously observed quantities. One must map this high dimensional data to lower dimensional representations for visualization. The purpose of research in this field is two-fold: first to discover mappings that reduce the information content as little as possible, and second, to make innovative use of computer graphics technology to in crease humanly perceivable dimensionality to be as high as possible. The first part involves research in statistical information theory, and the second research in computer graphics and scientific data visualization.
top of page

Trevor Hastie

I am an applied statistician, with a joint position in Biostatistics in the medical school. Most of my research is generated from applications with which I am involved. I tend to teach applied classes as well. For example I have taught parts of the 315,316 Ph.D first year graduate series, as well as 341, an applied multivariate class, and 315, our new Modern Applied Statistics class.

My research has focussed on function approximation and curve fitting within a variety of different applications. A common theme in my research so far has been to try and provide methodology that naturally bridges the gap between the traditional well tested linear techniques, and the newer more adventurous nonparametric frontiers.

Generalized additive models adapt nonparametric regression technology to provide more flexibility to the usual linear models used in applied areas, such as logistic and log-linear models. Principal curves and surfaces generalize linear principal components by allowing nonlinear coordinate functions. Currently I am interested in flexible methods for classification, and my research is focussed on developing richer classes of models for this task, as well as to understand better the nature of the classification problem.

Sometimes even the linear techniques are too rich, such as when the variables are sampled versions of a smooth function or image. The exponential growth in computer processing speed and storage has allowed us to routinely gather data where each observation is one or more digitized image. This has lead to a new field currently known as functional data analysis, and is filled with many interesting (open) problems, because the traditional techniques no longer work.
top of page

Iain M. Johnstone

Much of my research is concerned with optimality in estimation how well can you estimate a parameter, function or feature with a limited amount of noisy data? Tools from statistical decision theory (akin to those of game theory) and probability limit theorems for large samples play a role. Sometimes the questions are basically for fun: why is optimality of an estimator for Poisson data the same as recurrence of a birth-and-death process; what on earth does it have to do with isoperimetric inequalities from partial differential equations? Alternatively, the theory may be abstracted from general scientific methodology what's so marvelous about maximum entropy, why are wavelets so wonderful? Othertimes, there is a tough practical issue what is the real error rate of a classification rule (the obvious estimates can be very, very wrong), and how well could you know it? This last connects to my work in the Medical School, where I am involved in various projects in arteriography, urology, heart surgery clinical trials, geographic medicine and medical imaging. Talking to a doc about her problem (not yours!) can be a welcome rest from theory and vice versa!
top of page

Helena Chmura Kraemer

I am an applied statistician particularly interested in the development of statistical methodology and application in the interstice area between biological and behavioral areas in medical research. In particular, my interests concern the assessment of reliability of measures, particularly diagnoses, assessment of the effects of problems in this area, development of research strategies, designs and analytic strategies to cope with such problems. Recently this interest has expanded to modification of signal detection methods to integrate different statistical approaches to assessment of the quality and cost-effectiveness of medical tests. Underlying all these is a long-standing interest in correlational methods, and in the issue of power in research.
top of page

Tze Leung Lai

My present research areas include sequential experimentation, adaptive inference and control, stochastic optimization, time series analysis and forecasting, regression analysis of censored and truncated failure time data, design and analysis of clinical trials, probability theory, and stochastic dynamical systems. My methodological research in these areas has been motivated by and is closely related to my applied interests in engineering systems, financial economics, and medicine. I am also currently involved in several research projects in the Division of Medical Informatics, the Cancer Hyperthermia Laboratory, and the Center for AIDS Research at Stanford.
top of page

Ingram Olkin

Recently I have focused on statistical methods for combining the results of independent studies. There is now a huge body of studies that deal with specific problems. For example, there are over 750 experiments on the effect of cloud seeding, some of which may use different seeding agents, may seed in different months, and so on. In the health sciences there are many studies concerned with the effects of a particular drug or treatment. For example, is a combination of estrogen and progesterone effective in reducing osteoporosis in women, or is aspirin effective in diminishing heart attacks? In each case, different studies may use different populations, concentrations, or frequency may vary, etc. The statistical problem is to provide procedures for combining the results of these studies. The set of such procedures has been called meta-analysis, in contrast to primary or secondary analyses. My work to date has culminated in a book (joint with Larry V. Hedges) on "Statistical Methods for Meta-analysis."

An area of current interest is how to model dependencies. The theory of inde pendent measures is well-developed, but except perhaps for the normal distribution, there are many ways for modeling non-normal bivariate distributions. The mechanisms for such modeling may arise from statistical concepts, physical structures, characterizations, mixtures, etc. Thus, we might expect different models for the joint failure distributions of the engines on an airplane, than the failure distributions of organs in the body. This area is fascinating in that it permits one to study various physical phenomena.
top of page

Richard Olshen

My research involves statistics and mathematics as they apply to problems in medicine. Typically, the applications are computer intensive. They nearly always involve sample reuse methods.

Binary tree-structured methods for classification, regression, and survival analysis have been an area of special interest. These techniques entail optimally choosing sequences of yes-no questions concerning predictors. The goal is to predict the out come for a test case for which predictors are known, but the outcome is not. Applications have included assessing the prognostic significance of dose intensity in diffuse large-cell lymphoma and predicting tendencies of certain organic compounds to cause ulcers.

Similar tree-structured algorithms, in this instance for clustering vectorial pixel intensities, are used for data compression in digital radiography.

Some research has focused on panel studies, in which many subjects are followed longitudinally. Applications have included the study of free speed human walking on a level surface, and, more recently, cholesterol.
top of page

Art Owen

My main teaching interests center on Ph.D. level applied statistics. This includes the first year applied sequence, which has been recently updated and the consulting laboratory. Since course work alone is not enough, I encourage students to get involved in projects.

My current research interests are computer experiments and empirical likelihood.

Computer experimentation is a new field in which methods of statistics (exploration, prediction, interval estimation, design) are adapted to deterministic responses com puted by simulators. These may be simulating fluid flow, integrated circuits or combus tion. Statistical methods are especially useful when the input space has a high dimension and the simulations take a long time to run. The language of probability is helpful in these problems. It can be injected by modeling the functions as realizations of random processes. I'm developing methods based on some randomness in the design.

Empirical likelihood is a distribution free mode of inference based on a nonparametric likelihood ratio. Some parametric likelihoods are chosen mostly for convenience, and methods based on them may not be consistent when the model is wrong. Empirical likelihood inferences are consistent under much weaker conditions. It has sampling properties similar to those of the bootstrap, but uses continuous optimization instead of discrete resampling.
top of page

David Rogosa

My research is in the development and application of statistical methods in the behavioral and social sciences. My major publications have been in the areas of longitudinal research (e.g., measurement of change), methods for the design and analysis of behavioral observations, and causal inference from experimental and nonexperimental studies.

Another part of my statistical activities is consulting with State and Federal gov ernmental agencies. One recent example is the development of the California Accountability Index, a composite measure of the quality and progress of public schools, for the California State Department of Education.

My teaching is a mixture of undergraduate and graduate courses: For undergraduates, an introductory course in the Department, and for graduate students, service courses for the Psychology Department and the School of Education plus advanced courses in psychometrics and applied statistics (e.g. longitudinal research, evaluation methods).
top of page

Joe Romano

Statistics is concerned with making sense or inferences about the world based on limited information and uncertainties. In contrast, mathematics is exact. The goal is to prove theorems based on a well-defined set of assumptions. It is the juxtaposition of statistics and mathematics that I find intriguing and challenging. Mathematical statistics serves to precisely quantify and explain what can be learned through "experimentation," in spite of having to acknowledge our uncertainty in the process.

While my own research has been theoretically oriented, much of it has been motivated by a desire to understand practical statistical methodology to obtain techniques they may be applied safely in practice. I have been particularly interested in advancing "nonparametric" techniques that do not rely on the statistician having to invoke unverifiable assumptions. In my work, I have tried to explore the extent of applicability of bootstrap and resampling methods, as well as understanding their limitations. My recent interests have focused on extending resampling methods to problems in time series analysis.
top of page

Charles M. Stein

I work mainly on a method of using elementary probabilistic ideas for the solution of problems which are usually attacked by other methods. There are applications to combinatorics, statistics, and traditional problems in probability theory.




top of page

Paul Switzer

Statistics can play an important role in the analysis of environmental monitoring data, detection of temporal and spatial trends, modeling the effects of human ac tivity on environmental quality and climate, the articulation of environmental standards, the optimization of pollution mitigation and control strategies, and the assessment of environmental impacts. Many branches of statistical theory can be brought together to address these problems including spatial analysis, time series, extreme value theory, design of experiments, and nonlinear response modeling
top of page

Guenther Walther

Over the last few years I have been working in various areas: Estimation under shape restrictions, the interplay of statistical accuracy and computational complexity, and solar physics, where joint research with the Applied Physics Department led me into time series analysis and bootstrap and subsampling problems. Currently I am working on a problem in flow cytometry (a technique for sorting cells) jointly with the Medical School. That problem involves nonparametric mixtures, and turned out to be interesting, challenging, and also happens to reveal a beautiful theoretical aspect.
top of page


Previous Our Brochure Contents Next


Contact  | Sitemap  | Directories  | Maps  & Directions  | Giving to Stanford
Copyright 2004Stanford University. All Rights Reserved. Stanford, CA 94305, (650) 723-2300
Terms of Use Copyright Complaints