19 de março de 2016

Gerenciamento de Dados: codificando dados faltantes

Podemos verificar dados faltantes na coluna especie quando procuramos por valores vazios (''). No resultado gerado temos Frequency Missing = 117669. Para ignorar os dados faltantes podemos simplesmente trocar o EQ por NE no código abaixo, executado no SAS Studio.

libname mydata "<caminho do arquivo>" access=readonly;
DATA new; set mydata.lajeado;
label especie="Espécies de Peixe";
if especie EQ '';
PROC SORT; by especie;
proc freq; tables especie;
run;

The FREQ Procedure



Espécies de Peixe
especieFrequencyPercentCumulative
Frequency
Cumulative
Percent
Frequency Missing = 117669


Com o novo código que ignora os dados faltantes temos o seguinte resultado, sem apresentar a mensagem Frequency Missing:

libname mydata "<caminho do arquivo>" access=readonly;
DATA new; set mydata.lajeado;
label especie="Espécies de Peixe";
if especie NE '';
PROC SORT; by especie;
proc freq; tables especie;
run;



The FREQ Procedure




Espécies de Peixe
especieFrequencyPercentCumulative
Frequency
Cumulative
Percent
A.acutus10.0010.00
A.aguaboen160.01170.01
A.albater30.00200.01
A.albifron10.00210.01
A.argenteu3510.113720.12
A.blax1620.055340.17
A.brevis31501.0236841.19
A.diaphanu60.0036901.19
A.elachyle110.0037011.19
A.euryceph3350.1140361.30
A.falcatus590.0240951.32
A.gigas1260.0442211.36
A.goyacens6810.2249021.58
A.halecinu37721.2286742.80
A.hypselon220.0186962.80
A.hystrix290.0187252.81
A.machrisi870.0388122.84
A.maculosu10.0088132.84
A.microlep28710.93116843.77
A.minutus740.02117583.79
A.normani4280.14121863.93
A.nuchalis220197.103420511.02
A.ocellatu10.003420611.02
A.robertsi50941.643930012.66
A.stigmatu30.003930312.67
A.ternetzi380.013934112.68
A.tetramer1490.053949012.73
A.tocantin210.013951112.73
A.ucayalen4790.153999012.89
Aequidens20.003999212.89
Anchoviell1250.044011712.93
Ancistrus3510.024016812.94
Ancistrus420.004017012.94
Ancistrus5270.014019712.95
Ancistrus610.004019812.95
Apionichth10.004019912.95
Aspidoras2060.074040513.02
Auchenipte300.014043513.03
B.copei510.024048613.05
B.cupido910.034057713.08
B.cuvieri54351.754601214.83
B.falcatus1000.034611214.86
B.filament50.004611714.86
B.fowleri16540.534777115.39
B.goulding710.024784215.42
B.longipin5950.194843715.61
B.niveatus2150.074865215.68
B.pinnicau80.004866015.68
B.velox18730.605053316.28
Batrachog13830.125091616.41
Batrachog270.005092316.41
Brycon114900.485241316.89
Brycon210520.345346517.23
BryconopsB4630.155392817.38
BryconopsC360.015396417.39
C.acutiros52341.695919819.08
C.adspersa1950.065939319.14
C.anchoveo320.015942519.15
C.araguaie6260.206005119.35
C.asellus22630.736231420.08
C.atrisign780.036239220.11
C.britskii26260.856501820.95
C.callicht170.016503520.96
C.cametana420.016507720.97
C.coecutie6530.216573021.18
C.cracenti2120.076594221.25
C.cyprinoi13160.426725821.67
C.dorsalis6530.216791121.88
C.epakros910.036800221.91
C.festivus22220.727022422.63
C.figueire40.007022822.63
C.gibbus12200.397144823.02
C.goulding480.027149623.04
C.hauxwell14780.487297423.52
C.immacula7310.247370523.75
C.inornata42871.387799225.13
C.johanna10.007799325.13
C.kelberi170.017801025.14
C.labrina440.017805425.15
C.labyrint64722.098452627.24
C.lepidota1130.048463927.28
C.leticiae1070.038474627.31
C.lugubris4770.158522327.46
C.macropom130.008523627.47
C.menezesi2720.098550827.56
C.mucipu720.028558027.58
C.piquiti19510.638753128.21
C.plumbea10.008753228.21
C.plumbeus4060.138793828.34
C.punctatu90.008794728.34
C.reticula250.018797228.35
C.saxatili150.008798728.35
C.schizodo10.008798828.35
C.signatus880.038807628.38
C.spilurus11470.378922328.75
C.strigata580.028928128.77
C.tocantin640.028934528.79
C.tucurui600.028940528.81
C.xinguens250.018943028.82
Centromoch120.008944228.82
Cetopsis10.008944328.82
Characidiu6080.209005129.02
CorydorasA10.009005229.02
CorydorasB150.009006729.02
Crenicich720.009006929.03
Curimata21280.699219729.71
Curimatida60.009220329.71
E.electric170.019222029.72
E.macrops7230.239294329.95
E.paradoxu6740.229361730.17
E.trilinea2670.099388430.25
Eigenmann1110.009389530.26
F.amazona20.009389730.26
G.carapo440.019394130.27
G.gulo14230.469536430.73
G.joselima12100.399657431.12
GlanidiumA730.029664731.14
GlanidiumB100.009665731.15
Gymnorham11020.039675931.18
H.acipense4820.169724131.34
H.armatus44571.4410169832.77
H.durivent40.0010170232.77
H.efasciat40.0010170632.78
H.ericae2190.0710192532.85
H.lacerdae430.0110196832.86
H.littoral1910.0610215932.92
H.malabari12290.4010338833.32
H.marginat3030.1010369133.41
H.mentalis1840.0610387533.47
H.microlep129394.1711681437.64
H.platyrhy5400.1711735437.82
H.punctata10.0011735537.82
H.spilomma630.0211741837.84
H.spinosis30.0011742137.84
H.tatauaia540.0211747537.86
H.ternetzi1680.0511764337.91
H.unimacul120443.8812968741.79
H.unitaeni130.0012970041.80
H.wilderi27030.8713240342.67
HomodiaetA50.0013240842.67
HomodiaetB10.0013240942.67
Hyphessobr320.0113244142.68
Hypoptopom5530.1813299442.86
Hypostom1010.0013299542.86
Hypostom1190.0013300442.86
Hypostom1210.0013300542.86
Hypostomu12470.0813325242.94
Hypostomu2250.0113327742.95
Hypostomu512550.4013453243.35
Hypostomu650.0013453743.35
Hypostomu765922.1214112945.48
Hypostomu8310.0114116045.49
Hypostomu96880.2214184845.71
Iguanodect140.0014186245.72
J.apenima6850.2214254745.94
J.elassona5940.1914314146.13
J.polylepi10670.3414420846.47
Jupiaba160.0114422446.48
Knodus520.0214427646.49
L.acipense50.0014428146.50
L.affinis12720.4114555346.90
L.batesii191736.1816472653.08
L.bistriat150.0016474153.09
L.desmotes5780.1916531953.27
L.fernande1320.0416545153.32
L.filament40.0016545553.32
L.frideric18610.6016731653.92
L.galaxias3010.1016761754.02
L.granti10.0016761854.02
L.griseus3960.1316801454.14
L.maculatu10.0016801554.14
L.octomacu430.0116805854.16
L.pachyche1350.0416819354.20
L.praelong140.0016820754.21
L.taeniofa10.0016820854.21
L.tigrinus15210.4916972954.70
L.trifasci150.0016974454.70
L.vittatus840.0316982854.73
Leporinu1A3700.1217019854.85
Leporinu1B4410.1417063954.99
Leporinus22570.0817089655.07
Leporinus33400.1117123655.18
Leporinus660.0017124255.18
Leporinus830.0017124555.18
Leptobryco150.0017126055.19
Loricaria10830.3517234355.54
Loricariic9960.3217333955.86
M.dichrour140254.5218736460.38
M.dispilom260.0118739060.39
M.durivent560.0218744660.41
M.hypsauch760.0218752260.43
M.hysteros5410.1718806360.60
M.loweae134894.3520155264.95
M.paucisqu130.0020156564.95
M.platycep1270.0420169265.00
M.pyrophth850.0320177765.02
M.setiger12510.4020302865.43
M.tergimac5360.1720356465.60
M.thoracat10.0020356565.60
M.torquatu5250.1720409065.77
M.uranosco170.0120410765.77
Mastiglani20.0020410965.77
MetynnisA240.0120413365.78
MetynnisB1090.0420424265.82
MicroglanA20.0020424465.82
MicroglanB20.0020424665.82
Moenkhausi40.0020425065.82
MyleusA1150.0420436565.86
MyleusB1730.0620453865.91
MyleusC90.0020454765.92
MyleusD130.0020456065.92
MyleusE50.0020456565.92
MyloplusA10.0020456665.92
MyloplusB140.0020458065.93
N.leporhin2720.0920485266.01
O.hoppei590.0220491166.03
O.niger69512.2421186268.27
Otocinclus2400.0821210268.35
P.aiereba440.0121214668.36
P.amazonic101033.2622224971.62
P.araguaie17580.5722400772.19
P.blochii65042.1023051174.28
P.brachypo1150.0423062674.32
P.brevis1170.0423074374.36
P.calhamaz70.0023075074.36
P.cayana18680.6023261874.96
P.compress6460.2123326475.17
P.costatus2380.0823350275.25
P.cristata4030.1323390575.38
P.fasciatu2530.0823415875.46
P.flavipin10540.3423521275.80
P.fourcroi4870.1623569975.95
P.granulos16770.5423737676.50
P.grunnien11560.3723853276.87
P.guianens30.0023853576.87
P.halisodo10.0023853676.87
P.hemiolio230.0123855976.88
P.junki19630.6324052277.51
P.mesopota90.0024053177.51
P.microps35341.1424406578.65
P.natterer21830.7024624879.35
P.nemurus240.0124627279.36
P.nigrican31431.0124941580.37
P.nigrolin11630.3725057880.75
P.ornatus240.0125060280.76
P.pariolis2350.0825083780.83
P.pauciras2170.0725105480.90
P.pirinamp14580.4725251281.37
P.serratus630.0225257581.39
P.squamosi83152.6826089084.07
P.tetramer1190.0426100984.11
P.vittata600.0226106984.13
Pachyurus1940.0626126384.19
Paralorica660.0226132984.21
Pe.flavipi4360.1426176584.35
Phenacorha400.0126180584.37
Pimelodel1970.0326190284.40
Pimelodel310.0026190384.40
PimelodusB180.0126192184.40
PimelodusN550.0226197684.42
Po.orbigny10.0026197784.42
Porotergu120.0026197984.42
Porotergu230.0026198284.42
PotamotryA200.0126200284.43
PotamotryB190.0126202184.44
PotamotryC590.0226208084.46
PotamotryD10.0026208184.46
PotamotryE270.0126210884.47
PotamotryF70.0026211584.47
PotamotryG400.0126215584.48
Potamotryg770.0226223284.51
Pseudacan2110.0026224384.51
R.affinis40931.3226633685.83
R.boehlkei450.0126638185.84
R.eastward1000.0326648185.87
R.geryi3350.1126681685.98
R.itacaiun10.0026681785.98
R.lanceola70.0026682485.98
R.lapidife14210.4626824586.44
R.marmorat1090.0426835486.48
R.rostratu520.0226840686.49
R.vulpinus109913.5427939790.04
R.zygonect80.0027940590.04
Retroculus2530.0827965890.12
RhamdiaB130.0027967190.12
Rhamphispn10.0027967290.13
Rivulus110.0027968390.13
S.amazonic2380.0827992190.21
S.bondi900.0328001190.23
S.brama8830.2828089490.52
S.eigenman15500.5028244491.02
S.gracilis3280.1128277291.12
S.hilarii1210.0428289391.16
S.jurupari13830.4528427691.61
S.lima6500.2128492691.82
S.macrurus540.0228498091.84
S.maculatu1500.0528513091.88
S.marmorat90.0028513991.89
S.muelleri30.0028514291.89
S.planicep130.0028515591.89
S.rhombeus76412.4629279694.35
S.rostratu3980.1329319494.48
S.tucuruie10.0029319594.48
S.vittatus10060.3229420194.81
Schultzich170.0129421894.81
Serrasalm1570.0229427594.83
St.nattere70.0029428294.83
Sternarch210.0029428394.83
Sternarcho10.0029428494.83
T.albus59041.9030018896.74
T.argenteu6370.2130082596.94
T.auritus1100.0430093596.98
T.chalceus17500.5630268597.54
T.galeatus2760.0930296197.63
T.piresi2390.0830320097.71
T.stellatu1540.0530335497.76
T.trifurca24870.8030584198.56
Tatia1670.0230590898.58
Tatia2650.0230597398.60
Tatia31520.0530612598.65
Tatia410.0030612698.65
TetragonoA6810.2230680798.87
TetragonoB30.0030681098.87
Tometes650.0230687598.89
V.cirrhosa1700.0530704598.95
VandelliaA220.0130706798.95
VandelliaC220.0130708998.96
Z.zungaro2690.0930735899.05
camarao610.0230741999.07
caranguejo20.0030742199.07
descarte9550.3130837699.38
hibrido270.0130840399.38
jovem90.0030841299.39
juvenil18920.61310304100.00
larva110.00310315100.00
Referência:
Verifique item 4 http://mduft.wikidot.com/aula3

Nenhum comentário:

Postar um comentário