Gregory R. Warnes



Education


  Ph.D. Biostatistics University of Washington, Seattle, WA   12/00
       Dissertation Topic: The Normal Kernel Coupler: an Adaptive Markov
         Chain Monte Carlo Method for Efficiently
         Sampling from Multi-modal Distributions
       Preceptor: Adrian E. Raftery, Ph.D.
       Biology Project: Genetic and Epigenetic Changes Associated with
         Progression from Barrett's Esophagus to
         Esophageal Adenocarcinoma
       Biology Advisor: Brian J. Reid, M.D. Ph.D.
  M.Sc. Biostatistics University of Washington, Seattle, WA   12/97
  B.Sc. Statistics Brigham Young University, Provo, UT   4/95
  A.Sc. Computer Science Weber State University, Ogden, UT   6/92



Employment


  Associate Professor Department of Biostatistics and Computational Biology, University of Rochester
     05/06 - present   Co-director of the biocomputing component of the University of Rochester Center
       for Biodefense Immune Modeleing, general statistical computing research
      (Hulin Wu, Ph.D. )
  Owner & Chief Random Technologies LLC
  Scientist   Developed software business providing enterprise-class packaging,
     12/06 - present     enhancements, services, and training for the open-source ``R'' statistical
        software system.
  Associate Director Biometrics and Reporting, Pfizer Global Research and Development
  Nonclinical Statistics   Supervise a team performing statistical analysis, methods development, and
     11/00 - 05/06   software implementation in support of genetic, genomic, proteomic, and
      metabanomic projects; computational statistics research. (Daniel R. Meyer, Ph.D.)
  Associate Research Department of Computer Science, Yale University
  Scientist   Development of parallel and distributed tools for statistics, development
     1/03 - 5/06   of algorithms and software for the analysis of genetic, genomic, proteomic,
      and metabanomic data. (Martin H. Schultz, Ph.D.)
  Summer Intern Statistics and Data Mining Research, Bell Labs, Lucent Technologies
     6/99 - 9/99   Developed modules for distributed pseudo-random number generation,
      distributed bootstrapping, and distributed Markov Chain Monte Carlo
      for the next-generation statistical computing system, OMEGAHAT.
      (John M. Chambers, Ph.D.)
  Research Assistant Division of Public Health Sciences, Fred Hutchinson Cancer Research Center
     9/99 - 10/00,   Developed methods for Markov Chain Monte Carlo on parallel computers.
     12/97 - 6/99   Constructed and maintained BEOWULF parallel computing cluster.
      Provided programming and Markov Chain Monte Carlo expertise to
      other project members. (Steven G. Self, Ph.D. and E. Georg Leubeck, Ph.D.)
  Research Assistant Department of Biostatistics, University of Washington
     6/95 - 12/97   Performed statistical analysis and programming. Maintained research
      database for a gastroenterology cancer research team. (various)


Appointments

  Member Pharmacogenomics Statistical Expert Team, Pharmaceutical Research and 2/04 - 5/06
    Manufacturers of America (PhRMA)  
  Scout Master Troop 50, Groton CT, Connecticut Rivers Council, Boy Scouts of America 4/04 - 10/05
  Associate Editor Journal of Statistical Software 11/02 - 12/04
  Senator Graduate and Professional Student Senate, University of Washington 9/97 - 6/99
  Member Student Technology Fee Committee, University of Washington 9/97 - 10/98
      Evaluated grant proposals, provided direction and oversight for  
      $4 Million of technology funds.
  Member Computing Committee, Department of Biostatistics, University of Washington 9/95 - 6/96
      Represented student interests. Provided technical expertise for  
      policy on the purchase and administration of computer equipment.  


Honors

  PGRD Achievement Award (division-wide recognition for excellence), Pfizer Global Research and
    Development, 6/04
  NIH AIDS Training Grant, 1/98 - 11/00
  Invited Participant, NPACI Parallel Computing Institute, 8/98
  Highest Score, 1997 Ph.D. Applied Exam, University of Washington Depts. of Statistics and Biostatistics, 8/97
  Member Golden Key National Honor Society 1992 - present
  Member $\mu\sigma\rho$ National Statistics Honor Society, '93 - present
  Questar Corporate Scholarship, 12/93 - 4/94
  C. Elrod Leary Scholarship, 8/94 - 4/95
  Missionary, France Paris Mission, The Church of Jesus Christ of Latter Day Saints, 10/89 - 11/91
  Eagle Scout, Boy Scouts of America


Membership in Professional Organizations

  American Statistical Association (ASA)
  ASA Section on Statistical Computing
  ASA Section on Statistical Graphics
  ASA BioPharmaceutical Section
  ASA Connecticut Chapter


Skills

  Collaborative Teamwork:
Participate on a variety of management, technology, and project teams spanning most aspects of pharmaceutical discovery, collaborate with scientists and physicians on the analysis and interpretation of diverse data sets

  Project Management:
successfully manage a variety of data analysis, methods development, process development, and programming projects

  Communication with Diverse Communities:
successful communication with scientists, physicians, managers, and others from diverse disciplines, cultures, and communities

  Practice/Process Development:
long-term interest and experience in developing standard protocols, processes and work techniques to accelerate and enhance the quality of statistical and scientific conclusions

  Experience with -omics technologies:
Affymetrix SNP and Gene Expression Arrays, Perlegen SNP, High-Throughput RT-PCR, 2-D Gel Proteomics, Mass Spec Proteomics, MRI Metabanomics, etc.

  Data analysis tools:
R, Splus, SAS, custom code, ...

  Programming languages:
Python, PERL, Java, Python, C, C++, Fortran, ...

  Software Protocols:
HTML/HTTP, XML, SOAP, CORBA, TCP/IP, MPI, PVM, ...

  Operating System Use and Programming APIs:
Unix (Linux, Solaris, ...) MS-Windows, Mac OSX, ...

  Spoken Languages:
Native English, Fluent French


Publications

   Kooner JS, Chambers JC, Aguilar-Salina CA, et al.
``Genome-wide scan identifies variation in MLXIPL associated with plasma triglycerides in man'', accepted, Nature Genetics.

  Burrows RB, Warnes GR, Hanumara RC
``Statistical Modeling of Biochemical Pathways'', IET Systems Biology, IET Syst. Biol. 1, 353 (2007)

   Mank-Seymour AR, Richmond JL, Wood LS, Reynolds JM, Fan Y, Warnes GR, Milos MP, Thompson JF
``Association of torsades de pointes with novel and known single nucleotide polymorphisms in long QT syndrome genes.'' American Heart Journal, Volume 152, Issue 6, Pages 1116-1122, August 2006

  Warnes GR
``Sample Size Estimation for Microarray Experiments using the SSIZE package'', R News, Volume 6, Issue 5, pp. 64-68, December 2006.

  Warnes GR, Jain N
``Balloonplot: A graphical tool for displaying tabular data'', R News, Volume 6, Issue 2, May 2006

  Warnes GR, Liu P.
``Sample Size Estimation for Microarray Experiments'', Technical report 06/06, Department of Biostatistics and Computational Biology, University of Rochester, 2006. (also submitted to Bioinformatics)

  Caba E, Dickinson DA, Warnes GR, Aubrecht J.
``Differentiating mechanisms of toxicity using global gene expression analysis in Saccharomyces cerevisiae,'' Special Issue on EEMS 2004, Mutation Research, Volume 575, Issues 1-2, Aug. 2005, Pages 34-46.

  Warnes GR.
``RSOAP - Using ``R'' with Python,'' PyZine, Volume 11, Issue 05, Apr. 2004.

  Dickinson DA, Warnes GR, Quievryn G, Messer J, Zhitkovich A, Rubitski E, and Jiri A.
``Differentiation of
DNA-reactive and non-reactive genotoxic mechanisms using gene expression profile analysis'', Mutation Research, Volume 549, Issues 1-2, May 2004, Pages 29-41

  Warnes GR.
``The Genetics Package,'' R News, Volume 3, Issue 1, Jun. 2003.

  Warnes GR.
``The Gregmisc Package: Something for Everyone'' submitted to R News.

  Warnes GR.
``HYDRA: A Java library for Markov Chain Monte Carlo,'' Journal of Statistical Software, Volume 7, Issue 4, Mar. 2002.

  Warnes GR.
``The Normal Kernel Coupler: An adaptive Markov Chain Monte Carlo method for efficiently sampling from multi-modal distributions,'' submitted to Journal of the American Statistical Association, currently in revision

  Yanez ND, Warnes GR, and Kronmal RA.
``A Univariate Measurement Error Model for Longitudinal Change,'' Communications in Statistics, Volume 30, Issue 2, 2001

  Warnes GR.
``The Normal Kernel Coupler: An adaptive Markov Chain Monte Carlo method for efficiently sampling from multi-modal distributions,'' Technical Report no. 395, Department of Statistics, University of Washington, Apr. 2001.

  Warnes GR.
``HYDRA: A Java library for Markov Chain Monte Carlo,'' Technical Report no. 394, Department of Statistics, University of Washington, Apr. 2001.

  Warnes GR.
``The Normal Kernel Coupler: An adaptive Markov Chain Monte Carlo method for efficiently sampling from multi-modal distributions,'' Ph.D. thesis, Department of Biostatistics, University of Washington, Oct. 2000.

  Llyon J, Fellingham GW, Tolley HD, Harris T, Hilton G, Warnes GR.
``Mortality Differences among Adult Men in Utah 1975-79 Associated with Difference in Tobacco and Alcohol Usage'' 1995 (Unpublished)


Presentations

   Lazarus R, Henderson D, Qui W, and Warnes  GR
``Beyond Expression: Statistical genetics and integrative genomics with Rgenetics'', Bioconductor User and Developer Conference, Seattle, WA, August 6-7, 2007

   Warnes GR, Liu P
``Sample Size Estimation for Omics Experiments'', 2007 Joint Statistical Meetings (JSM 2007), Salt Lake City, UT, July 29-August 2, 2007

   Wu H, Warnes GR, Miao H, Wu C, LeBlanc A
``DEDiscover Differential Equation Modeling System'', Workshop on Statistical Mathods for Modeling Dynamic Systems, Centre de Recherches Mathematiques, Universite de Montreal, Montreal Canada, July 9-13, 2007

   Warnes GR
``Open Source Software in Pharmaceutical Research'', 2007 Drug Industry Association Annual Meetings (DIA 2007), Atlanta, GA, Jun 18-23, 2007

   Warnes GR
``Open Source Software in Pharmaceutical Discovery'', 2007 Midwest Biopharmeceutical Workshop, Drug Industry Association Annual Meetings (DIA 2007), Muncie, IN, May 20-22, 2007

   Warnes GR, Rogers JA, Kuhn M.
``Open Source Software in Pharmaceutical Research'', 2006 Joint Statistical Meetings (JSM 2006), Seattle, WA, Aug 6-10, 2006

   Warnes GR, Chasalow S, Montana G, O'Connell M, Henderson D, Jain N, Qiu W, Cheng J, Lazarus R
``The R Genetics Project: Bioconductor for Genetics'', Bioconductor User and Developer Conference (BioC 2006), Seattle, WA, Aug 3-4, 2006

   Warnes GR, Rogers JA, Kuhn M.
``Open Source Software in Pharmaceutical Research'', 2nd International R User Conference (UseR! 2006), Vienna, Austria, Jun 15-17, 2006

   Warnes GR, Chasalow S, Montana G, O'Connell M, Henderson D, Jain N, Qiu W, Cheng J, Lazarus R
``The R Genetics Project: Bioconductor for Genetics'', 2nd International R User Conference (UseR! 2006), Vienna, Austria, Jun 15-17, 2006

   Warnes GR, Burrows R
``Effective Simulation and Analysis of Biological Systems Through a Synergistic Combination of Deterministic Mathematical Modeling and Bayesian Statistical Techniques'', Department of Biostatistics and Computational Biology, University of Rochester, Rochester, NY, Feb 17, 2006

   Warnes GR.
``OpenStatServer: Deploying Custom Statistical Computation to Scientific Clients'', Systems Personnel Activity Meeting (SPAM) talk, Yale Department of Computer Science, New Haven, CT, Jan. 27, 2005

   Warnes GR.
``Data Mining Opportunities in Pharmaceutical Research,'' Workshop on Data Mining, Fields Institute, Toronto, Canada, Nov. 11 2005

   Warnes GR.
Discussant for ``Chemometrics Applications To Systems Biology: Genomics, Proteomics, Metabonomics And Lipomics,'' The Gordon Conference On Statistics In Chemistry And Chemical Engineering, Mount Holyoke College, South Hadley, MA, Jul. 21, 2005

   Warnes GR.
``The Rstatserver And Chaco Projects: Deploying Statistical Computation To Scientists'', Statistical Computing Seminar, Glaxosmithkline, Indianapolis, In, May 25, 2005

   Warnes GR.
``Sample Size Estimation For Microarray Experiments'', Midwest Biopharmaceutical Statistics Workshop , Indianapolis, In, May 25, 2005

   Warnes GR.
``Double Header: Sample Size Estimation For High Resolution Biology Assays And Rstatserver, An Infrastructure For Rapid Creation Of Statistical Web Applications,'' Department Of Statistics Seminar, Department Of Statistics, Yale University, New Haven, Ct, Apr. 11, 2005.

   Warnes GR.
``The R Genetics Package: Tools For Statistical Genetics,'' Invited Talk, 3rd. Annual Statistics Mini-Conference, Connecticut Chapter, American Statistical Association, Mar. 5, 2005.

   Warnes GR.
``Double Header: Sample Size Estimation For High Resolution Biology Assays And Rstatserver, An Infrastructure For Rapid Creation Of Statistical Web Applications ,'' Epidemiology And Biostatistics Seminar Series, Department Of Epidemiology And Biostatistics, Memorial Sloan-Kettering Cancer Center, New York, Ny, Oct. 20, 2004.

   Warnes GR.
``Omics Technologies At Pfizer,'' Bioinformatics Seminar Series, Rhode Island Biomedical Research Infrastructure Network (Brin), University Of Rhode Island, Kingston, Ri, Oct. 1, 2003.

   Warnes GR And Fasheng L.
``Sample Size Selection For Microarray Based Gene Expression Studies,'' Talk, ``2003 Fda/Industry Statistics Workshop: From Theory To Regulatory Acceptance'', American Statistical Association, Bethesda, Md, Sep 18-19, 2003.

   Warnes GR.
``Omics Technologies At Pfizer,'' Seminar Series, Computational Biology And Bioinformatics Program, Yale University, New Haven, Ct, Aug. 13, 2003.

   Dickinson DA And Warnes GR.
"Genotoxic Stress-Associated Gene Expression Profiles In Saccharomyces Cerevisiae," Poster, Environmental Mutagen Society Annual Meeting, Miami, Florida, May 10 - 14, 2003.

  Warnes GR.
``RSOAP: A simple SOAP server for R,'' Talk, Third International Workshop on Distributed Statistical Computing (DSC 2003), Technische Universität Wien. Vienna, Austria, Mar. 19-22, 2003.

  Warnes GR.
``R + Zope = RStatServer,'' Talk, Third International Workshop on Distributed Statistical Computing (DSC 2003), Technische Universität Wien. Vienna, Austria, Mar. 19-22, 2003.

  Warnes GR.
``Efficient and Adaptive MCMC by Coupling Multiple Samplers,'' Seminar, Department of Statistics, University of California at Los Angelos. Los Angelos, California, Feb. 18, 2003.

  Warnes GR.
``Efficient and Adaptive MCMC by Coupling Multiple Samplers,'' Invited Talk, Joint Statistical Meetings, hosted by American Statistical Association, the International Biometric Society, the Institute of Mathematical Statistics, and the Statistical Society of Canada. New York, New York, Aug. 3-7, 2002

  Warnes GR.
``HYDRA: A Java library for Markov Chain Monte Carlo,'' Second International Workshop on Distributed Statistical Computing (DSC 2001), hosted by the Vienna University of Technology. Vienna, Austria. Mar. 22-24, 2001

  Warnes GR.
``Normal Kernel Coupling: An Efficient MCMC Method for Sampling from Multimodal Distributions,'' Seminar, Department of Biostatistics, University of Washington. Seattle, Washington. Aug., 2000.

  Warnes GR.
``ClusterNFS: Simplifying Linux Clusters,'' The Seattle SAGE Group (SSG). Seattle, Washington. May 11, 1999

  Warnes GR.
``Random Number Generation for Parallel and Threaded Programs,'' Workshop on Distributed Statistical Computing, hosted by the Vienna University of Technology. Vienna, Austria. Mar. 19-23, 1999

  Rossini, AJ and Warnes GR.
``An Introduction to CORBA for Statisticians,'' Statistical Science and the Internet, hosted by Statistics and Data Mining Research, Bell Labs. Drew University, Madison, New Jersey. Jul. 12-14, 1998


Software

  Wu H, Warnes GR, Miao H, Wu C, LeBlanc A
``DEDiscover'' a cross-platform software tool for differential equation model simulation and estimation, designed with special attention to the features necessary for modeling the interaction between the human immune system and viruses. https://cbim.urmc.rochester.edu/software/dediscover, 2007-

  Qiu W, Lazarus R, Warnes GR, Jain N
``GeneticsQC'', a package of classes and functions for the open-source statistical package ``R'' that checks the quality of a genetics data set (as a geneSet object), such as reporting the counts of missing genotypes for markers or for subjects, checking Mendelian errors, testing Hardy-Weinberg Equilibrium, etc. This package also provides functions to filter out low-quality markers, subjects, and/or families. http://r-genetics.org, 2007-

  Warnes GR, Duffy D, Man M, Qiu W, Lazarus R
``GeneticsDesign'', a package for the open-source statistical package ``R'' that provides classes and functions for designing genetics studies, including power and sample-size calculations.
http://bioconductor.org/packages/2.0/bioc/html/GeneticsDesign.html,
2007-

  Qiu W, Lazarus R, Warnes GR, Jain N
``fbat'', a package for the open-source statistical software package ``R'' that implements a broad class of Family Based Association Tests for genetids data, with adjustments for population admixture using the code from the 'FBAT' software program. http://bioconductor.org/packages/2.0/bioc/html/fbat.html, 2006-

  Warnes GR, Lazarus R, Chasalow SD, Montana G, O'Connel M, Cheng J, Jain N
``GeneticsBase'', a package for the open-source statistical package ``R'' that provides classes and functions for handling and analyzing large scale genetic data (up to 1e6 markers) http://bioconductor.org/packages/2.0/bioc/html/GeneticsBase.html, 2005-

  Warnes GR
''WardListing'', a tool for for downloading LDS Ward Membership information from the official LDS web site, and translating it into formats appropriate for importing into address book software. http://www.warnes.net/Software/WardListing, 2005-

  Smith CR and Warnes GR
``Rlsf'', a package of functions for the open-source statistical package ``R'' that provides functions for using R with the LSF cluster/grid queuing system. http://cran.r-project.org/src/contrib/Descriptions/Rlsf.html, 2005-

  Warnes GR and Li F.
``ssize'', , a package of functions for the open-source statistical package ``R'' that provides functions for computing and displaying sample size information for gene expression arrays. http://www.bioconductor.org/packages/bioc/1.6/src/contrib/html/ssize.html,
2004-

  Warnes GR.
``gmodels'', a package of functions for the open-source statistical package ``R'' that provides various R programming tools for model fitting http://cran.r-project.org/src/contrib/Descriptions/gplots.html, 2005-

  Warnes GR.
``gdata'', a package of functions for the open-source statistical package ``R'' that provides various R programming tools for data manipulation. http://cran.r-project.org/src/contrib/Descriptions/gplots.html, 2005-

  Warnes GR.
``gtools'', a package of functions for the open-source statistical package ``R'' that provides various general purpose programming tools.
http://cran.r-project.org/src/contrib/Descriptions/gtools.html, 2005-

  Warnes GR.
``gplots'', a package of functions for the open-source statistical package ``R'' that provides various R programming tools for plotting data. http://cran.r-project.org/src/contrib/Descriptions/gplots.html, 2005-

  Moriera W, Warnes GR.
``rpy'', a robust Python interface to the R Programming Language, http://rpy.sf.net, 2004-

  Warnes GR.
``fork'', a package of functions for the open-source statistical package ``R'' that provide simple wrappers around the Unix process management API calls: fork, wait, waitpid, kill, and _exit. This enables construction of R programs that utilize multiple concurrent processes. http://cran.us.r-project.org/src/contrib/Descriptions/fork.html, 2003-

  Warnes GR.
``fpconst'', a Python library providing constants and functions for creating and detecting IEEE 754 the floating point special values. http://www.analytics.washington.edu/statcomp/projects/rzope/fpconst/, 2003-

  Warnes GR.
``CSVFile'', objects for the open-source web application development system ``Zope'' which automatically detecting and translating Microsoft Excel files into comma-delimited text files when uploaded. http://www.analytics.washington.edu/statcomp/projects/rzope/csvfile/,
2003-

  Warnes GR.
``RSessionDA'', for the open-source web application development system ``Zope'' which allow access to the features of the open-source statistical package/language R. http://www.analytics.washington.edu/statcomp/projects/rzope/rsessionda/, 2003-

  Warnes GR.
``session'', a package of function for open-source statistical package ``R'' that permit the state of the R session to be saved and restored, as well as functions for capturing the result of evaluating strings containing R commands and capturing the output. http://cran.r-project.org/src/contrib/Descriptions/session.html, 2002-

  Warnes GR.
``RSOAP'', a server providing access to the features of the open-source statistical package R via the SOAP protocol. http://research.warnes.net/projects/RStatServer/rsoap/, 2002-

  Ullman C, Matthews B, Warnes GR, Blunk C.
``SOAPpy'', a SOAP implementation for python.
http://pywebsvcs.sourceforge.net, 2002-2004

  Warnes GR and Leisch F.
``genetics'', a package for handling marker-based genetic data within the open-source statistical package ``R''. The package includes function to compute allele frequencies, use genetic markers in statistical models, estimate disequilibrium, and test for departure from Hardy-Weinberg equilibrium. http://cran.us.r-project.org/src/contrib/PACKAGES.html#genetics,
2002-

  Warnes GR et al
``gregmisc'', a package of useful utility functions for the open-source statistical package ``R''. Most functions in the gregmisc library fall into five general areas: permutations and combinations, tools for linear models, plots, data manipulation, and fixed or extended versions of existing functions. http://cran.us.r-project.org/src/contrib/PACKAGES.html#gregmisc, 2001-

  Warnes GR.
``DistLib'', an Java library containing classes for computing features of and generating random numbers from a variety of statistical distribution functions. http://statdistlib.sourceforge.net/, 2000-

  Warnes GR.
``mcgibbsit'', a package for the open-source statistical package ``R'' implementing the MCGIBBSIT MCMC diagnostic software for multiple (potentially interrelated) MCMC samplers. http://cran.r-project.org/src/contrib/Descriptions/mcgibbsit.html, 2000-

  Warnes GR.
``HYDRA'', an open-source, platform-neutral library for performing Markov Chain Monte Carlo. It implements the logic of standard MCMC samplers within a framework designed to be easy to use and to extend while allowing integration with other software tools. http://research.warnes.net/projects/mcmc/hydra, 2000-

  Warnes GR.
``ClusterNFS'', an NFS server that allows diskless NFS clients to share a common file system by providing for host, user, and group-specific files within the same directory structure via interpreted file name extensions. http://ClusterNFS.sourceforge.net, 1999-

(Last updated December 19, 2007)



Gregory R. Warnes 2007-12-19