Bulletin of the Seismological Society of America
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Bulletin of the Seismological Society of America; February 1999; v. 89; no. 1; p. 294-304
© 1999 Seismological Society of America
This Article
Right arrow Full Text (PDF)
Right arrow References
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Sain, S. R.
Right arrow Articles by Fisk, M. D.
Right arrow Search for Related Content
GeoRef
Right arrow GeoRef Citation

Outlier detection from a mixture distribution when training data are unlabeled

Stephan R. Sain, H. L. Gray, Wayne A. Woodward and Mark D. Fisk

Department of Statistical Science Southern Methodist University, P.O. Box 750332, Dallas, Texas 75275-0332
Mission Research Corporation, 8560 Cinderbed Road, Suite 700, Newington, Virginia 22122

Abstract

We consider the difficult task of using seismic signals (or any other discriminants) for detecting nuclear explosions from the large number of background signals such as earthquakes and mining blasts. Given a ground-truth database (i.e., labeled data), Fisk et al. (1996) consider the problem of detecting outliers (nuclear explosions) from a single background-signal population, and their approach has been applied successfully in several regions around the world. Wang et al. (1997) attack the problem in terms of modeling the background as a mixture distribution and looking for outliers (nuclear events) from that mixture. However, those authors only considered the case in which at least some fraction of the training sample was labeled, that is, at least some ground-truth information was available, and the number of distinct classes of events was known. In the current article, we extend these results to the case in which no events in the training sample are labeled and also to the case in which the number of event types represented in the training sample is unknown. One can view the mixture approach as a robust method for fitting a density to training data that may not be normally distributed whether or not the data consist of identifiable components that have a physical interpretation. The technique is demonstrated using simulated data as well as two sets of seismic data.







HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Copyright © 1999 by the Seismological Society of America.