Exploratory Data Analysis (EDA): Difference between revisions

From InfoVis:Wiki
Jump to navigation Jump to search
No edit summary
No edit summary
Line 1: Line 1:
{{Definition|'''Exploratory data analysis (EDA)''' was introduced by [[Tukey, John|John Tukey]] as an approach to analyze data when there is only a low level of knowledge about its cause system as well as ''contextual'' information. EDA aims at letting the data itself influence the process of suggesting hypotheses instead of only using it to evaluate given ''(a priori)'' hypotheses.}}
{{Definition|'''Exploratory data analysis (EDA)''' was introduced by [[Tukey, John|John Tukey]] as an approach to analyze data when there is only a low level of knowledge about its cause system as well as ''contextual'' information. EDA aims at letting the data itself influence the process of suggesting hypotheses instead of only using it to evaluate given ''(a priori)'' hypotheses.<br>Explorative - opposed to Confirmatory - Data Analysis is like detective work looking for patterns, analomies or in general new insights and is usually done via graphical representations of the underlying data-set.}}


{{Quotation|Exploratory Data Analysis (EDA) is an approach/philosophy for data analysis that employs a variety of techniques (mostly graphical) to maximize<br>1. insight into a data set;<br>2. uncover underlying structure;<br>3. extract important variables;<br>4. detect outliers and anomalies;<br>5. test underlying assumptions;<br>6. develop parsimonious models; and<br>7. determine optimal factor settings.<br><br>The EDA approach is precisely that--an approach--not a set of techniques, but an attitude/philosophy about how a data analysis should be carried out.| [Filliben, 2004]}}
{{Quotation|Exploratory Data Analysis (EDA) is an approach/philosophy for data analysis that employs a variety of techniques (mostly graphical) to maximize<br>1. insight into a data set;<br>2. uncover underlying structure;<br>3. extract important variables;<br>4. detect outliers and anomalies;<br>5. test underlying assumptions;<br>6. develop parsimonious models; and<br>7. determine optimal factor settings.<br><br>The EDA approach is precisely that--an approach--not a set of techniques, but an attitude/philosophy about how a data analysis should be carried out.| [Filliben, 2004]}}

Revision as of 11:08, 12 September 2005

Exploratory data analysis (EDA) was introduced by John Tukey as an approach to analyze data when there is only a low level of knowledge about its cause system as well as contextual information. EDA aims at letting the data itself influence the process of suggesting hypotheses instead of only using it to evaluate given (a priori) hypotheses.
Explorative - opposed to Confirmatory - Data Analysis is like detective work looking for patterns, analomies or in general new insights and is usually done via graphical representations of the underlying data-set.
Exploratory Data Analysis (EDA) is an approach/philosophy for data analysis that employs a variety of techniques (mostly graphical) to maximize
1. insight into a data set;
2. uncover underlying structure;
3. extract important variables;
4. detect outliers and anomalies;
5. test underlying assumptions;
6. develop parsimonious models; and
7. determine optimal factor settings.

The EDA approach is precisely that--an approach--not a set of techniques, but an attitude/philosophy about how a data analysis should be carried out.
[Filliben, 2004]


[...] is concerned primarily with explorations and description of data, not with inference. The techniques are designed to identify fundamental, conceptually meaningful patterns and relationships in data and to call attention to observations that deviate greatly from those fundamental patterns
[Smith and Prentice, 1993]


Furthermore, EDA can be used to support the selection of appropriate statistical tools as well as to provide a basis for statistical inference and further data collection.

Essential to EDA are graphical tools like box plots, stem–and–leaf plots, scatter plots, or timelines.

References