Exploratory Data Analysis (EDA)

From InfoVis:Wiki
Revision as of 05:03, 13 August 2007 by WatHdm (talk | contribs)
Jump to navigation Jump to search

occidentale (gambia) il box mostruoso corse con uato belle fighe gratis font animali lancia lybra 2.4 jtd lx la fidanzata di tutti il diritto dovere dvd a hanging rock red faction sylvain viaccess keys arabesque chat live senza scaricare il dialer singola cerca uomo western digital wd740gd hausmannite concorso di mister gay d italia red hot chili pepper s cp 35 lazio nw-hd3 network walkman s5000 accessori i segreti della grande piramide nike air max donna faccia al muro adventure in lay gel per ricostruzione le avventure di davy crockett sfondi matrix notebook borsa samsonite hotel a porto rose senza ragione yamaha ax 496 toma zdravkovic hp omnibook 500 p3 www lauritz jeans take -two kimberly concessionario lada violenza sulle donne caricabatterie ricaricabili spoof serena grandi attice nuda nebulizzatore ultrasuoni biglietti visita micro registratori olympus www patatine com behringer equalizer recensione di madame bovary kiss me sunlight sapphire - ati x850 xt 256mb 256bit vivo www ea games it origini delle olimpiadi stop free albergo male consigli per truccarsi creedence jeep cherokee td hotel alberghi rimini giostre copertine cd audio scarica programma man rochas baldan bembo l amico e fiat punto jtd hlx wyatt, sir thomas keilor un animale irragionevole go fast ivas vernici clopedia benvenuto il luogo cordless rumblepad 2 joystick karaoke perche no chat gay roma gratis climatizzatori mitsubishi il nuovo it nauthy girl prodotti per dj collu calendario loredana lecciso auto lancia thesis voli da verona a roma ef 28-135mm f 3 5-5 6 is usm hsas agordo crotone luisa particolari te quiero tanto tanto a chi m idice rex ri 800 xc formule matematica finanziaria sony tv usb fan tutti i cartoni lacie data bank 60gb abit ic7 prontoenel calma e sangu freddo schwelm broken sword 3 gold in europa alberghi lago iseo la fattoria di maiano srl buona fortuna baglioni schemi mister universo buy alprazolam www milleuna tim it citrovorum factor thermaltake tsunami va3000swa phyto hot swap hp suoneria nokia gratis electronica eos 50 e sporca p didy mario video sexy 3gp o mp4 agp8x hdtv chario pegasus manga com wva 510de difendi il tuo occlusione intestinale assorbente avellino negozi tre agriturismo pomezia wave master orditec multifunzione canon con fax la poetica di svevo vampires zombies blow job film dvd antisala aepd aeg frigorifero presidentes foto annunci incesti adattatore bluetooth epson telefoni cellulare samsung productos notables vendita case amalfi billy joey ibm intellistation m pro kit ricarica cartuccia fitness models roma calcio sito ufficiale berengo fine arts srl mature com george michael franchising telefonia tuscany lodging durex easy-on kracheh stil unic portable dvd player energy veritas canon ir 2200 convegni a milano uccelli di rovo p4 3 6 ghz agricoltura e silvicoltura - macchine e accessori lg registratore dvd con hard disk knights - i cavalieri del futuro senza mutande testi per pianoforte emily dickinson hard disk 2 5 80 gb samsung 5400rpm lc 37 sharp divafutura it studentesse camerino last minute croazia sito e nuove immagini per alexander mp3 de los angeles de charly trascendentalismo uva uva dimensioni lavastoviglia monique i d rather go blind www francesco renga katana bamboo ilgrande baboomba rete stampanti foto modelle adolescenti reflex nikon digitale d70 crack of call of duty simona regolo www playtimes it tecra m3 1 6 manzini (distretto) phone effect rex rd 185 dx fukaimori rivestimento interno per carrozzine panasonic hd ivona india movie o tette condizionatori rumorosi patchwork libri lettori mp3 hp dialogismo macchina caffe pavoni

Exploratory data analysis (EDA) was introduced by John Tukey as an approach to analyze data when there is only a low level of knowledge about its cause system as well as contextual information. EDA aims at letting the data itself influence the process of suggesting hypotheses instead of only using it to evaluate given (a priori) hypotheses.
Explorative - opposed to Confirmatory - Data Analysis is like detective work looking for patterns, analomies or in general new insights and is usually done via graphical representations of the underlying data-set.
Exploratory Data Analysis (EDA) is detective work – numerical detective work – or counting detective work – or graphical detective work ... unless exploratory data analysis uncovers indications, usually quantitative ones, there is likely to be nothing for confirmatory data analysis to consider ... [it] can never be the whole story, but nothing else can serve as the foundation stone - as the first step.
[Tukey, 1977, p. 1-3]


Exploratory Data Analysis (EDA) is an approach/philosophy for data analysis that employs a variety of techniques (mostly graphical) to maximize
1. insight into a data set;
2. uncover underlying structure;
3. extract important variables;
4. detect outliers and anomalies;
5. test underlying assumptions;
6. develop parsimonious models; and
7. determine optimal factor settings.

The EDA approach is precisely that--an approach--not a set of techniques, but an attitude/philosophy about how a data analysis should be carried out.
[Filliben, 2004]


[...] is concerned primarily with explorations and description of data, not with inference. The techniques are designed to identify fundamental, conceptually meaningful patterns and relationships in data and to call attention to observations that deviate greatly from those fundamental patterns
[Smith and Prentice, 1993]


Exploratory data analysis as the process of searching and analyzing databases to find implicit but potentially useful information, is a difficult task. At the beginning, the analyst has no hypothesis about the data. According to John Tuckey, tools as well as understanding are needed [Tukey, 1977] for the interactive and usually undirected search for structures and trends.
[Keim et al., 2006]



Furthermore, EDA can be used to support the selection of appropriate statistical tools as well as to provide a basis for statistical inference and further data collection.

Essential to EDA are graphical tools like box plots, stem