{"id":547,"date":"2025-07-28T13:22:16","date_gmt":"2025-07-28T17:22:16","guid":{"rendered":"https:\/\/literaciadigital.ufms.br\/?page_id=547"},"modified":"2025-10-06T22:02:41","modified_gmt":"2025-10-07T02:02:41","slug":"8-3","status":"publish","type":"page","link":"https:\/\/literaciadigital.ufms.br\/en\/data8\/8-0\/8-3\/","title":{"rendered":"Cap\u00edtulo 8.3"},"content":{"rendered":"<div style=\"position: relative\">\n<div style=\"float: left;width: 300px;background-color: #f5f5f5;border: 1px solid #ddd;border-radius: 5px;padding: 15px;margin-right: 20px;margin-bottom: 5px;overflow: hidden\">\n<h3 style=\"margin: 0 0 10px 0;padding-bottom: 8px;border-bottom: 1px solid #ddd\">\u00cdndice<\/h3>\n<ol style=\"margin: 0;padding-left: 0;list-style-type: none\">\n<li style=\"margin-bottom: 5px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/1-0\/\">1. O que \u00e9 Ci\u00eancia de Dados?<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/1-0\/1-1\/\">1.1. Introdu\u00e7\u00e3o<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/1-0\/1-1\/1-1\/\">1.1.1. Ferramentas Computacionais<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/1-0\/1-1\/1-2\/\">1.1.2. T\u00e9cnicas Estat\u00edsticas<\/a><\/li>\n<\/ul>\n<\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/1-0\/1-2\/\">1.2. Por que Ci\u00eancia de Dados?<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/1-0\/1-3\/\">1.3. Tra\u00e7ando os Cl\u00e1ssicos<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/1-0\/1-3\/3-1\/\">1.3.1. Personagens Liter\u00e1rios<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/1-0\/1-3\/3-2\/\">1.3.2. Outro Tipo de Personagem<\/a><\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<\/li>\n<li style=\"margin-bottom: 5px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/2-0\/\">2. Causalidade e Experimentos<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/2-0\/2-1\/\">2.1. John Snow e a Bomba da Broad Street<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/2-0\/2-2\/\">2.2. O &#8220;Grande Experimento&#8221; de Snow<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/2-0\/2-3\/\">2.3. Estabelecendo Causalidade<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/2-0\/2-4\/\">2.4. Randomiza\u00e7\u00e3o<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/2-0\/2-5\/\">2.5. Notas Finais<\/a><\/li>\n<\/ul>\n<\/li>\n<li style=\"margin-bottom: 5px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/3-0\/\">3. Progamando em Python<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/3-0\/3-1\/\">3.1. Express\u00f5es<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/3-0\/3-2\/\">3.2. Nomes<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/3-0\/3-2\/2-1\/\">3.2.1. Exemplo: Taxas de Crescimento<\/a><\/li>\n<\/ul>\n<\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/3-0\/3-3\/\">3.3. Chamadas<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/3-0\/3-4\/\">3.4. Introdu\u00e7\u00e3o \u00e0s Tabelas<\/a><\/li>\n<\/ul>\n<\/li>\n<li style=\"margin-bottom: 5px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/4-0\/\">4. Tipos de Dados<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/4-0\/4-1\/\">4.1. N\u00fameros<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/4-0\/4-2\/\">4.2. Strings<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/4-0\/4-2\/2-1\/\">4.2.1. M\u00e9todos de Strings<\/a><\/li>\n<\/ul>\n<\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/4-0\/4-3\/\">4.3. Compara\u00e7\u00f5es<\/a><\/li>\n<\/ul>\n<\/li>\n<li style=\"margin-bottom: 5px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/5-0\/\">5. Sequ\u00eancias<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/5-0\/5-1\/\">5.1. Arrays<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/5-0\/5-2\/\">5.2. Ranges<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/5-0\/5-3\/\">5.3. Mais sobre Arrays<\/a><\/li>\n<\/ul>\n<\/li>\n<li style=\"margin-bottom: 5px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/6-0\/\">6. Tabelas<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/6-0\/6-1\/\">6.1. Ordenando Linhas<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/6-0\/6-2\/\">6.2. Selecionando Linhas<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/6-0\/6-3\/\">6.3. Exemplo: Tend\u00eancias Populacionais<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/6-0\/6-4\/\">6.4. Examplo: Propor\u00e7\u00f5es de Sexos<\/a><\/li>\n<\/ul>\n<\/li>\n<li style=\"margin-bottom: 5px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/7-0\/\">7. Visualiza\u00e7\u00e3o<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/7-0\/7-1\/\">7.1. Visualizando Distribui\u00e7\u00f5es<br \/>\nCateg\u00f3ricas<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/7-0\/7-2\/\">7.2. Visualizando Distribui\u00e7\u00f5es Num\u00e9ricas<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/7-0\/7-3\/\">7.3. Gr\u00e1ficos Sobrepostos<\/a><\/li>\n<\/ul>\n<\/li>\n<li style=\"margin-bottom: 5px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/8-0\/\">8. Fun\u00e7\u00f5es e Tabelas<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/8-0\/8-1\/\">8.1. Aplicando Fun\u00e7\u00e3o a uma Coluna<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/8-0\/8-2\/\">8.2. Classificando por uma Vari\u00e1vel<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/8-0\/8-3\/\">8.3. Classifica\u00e7\u00e3o Cruzada<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/8-0\/8-4\/\">8.4. Unindo Tabelas por Colunas<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/8-0\/8-5\/\">8.5. Compartilhamento de Bicicletas<\/a><\/li>\n<\/ul>\n<\/li>\n<li style=\"margin-bottom: 5px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/9-0\/\">9. Aleatoriedade<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/9-0\/9-1\/\">9.1. Declara\u00e7\u00f5es Condicionais<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/9-0\/9-2\/\">9.2. Itera\u00e7\u00e3o<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/9-0\/9-3\/\">9.3. Simula\u00e7\u00e3o<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/9-0\/9-4\/\">9.4. O Problema de Monty Hall<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/9-0\/9-5\/\">9.5. Encontrando Probabilidades<\/a><\/li>\n<\/ul>\n<\/li>\n<li style=\"margin-bottom: 5px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/10-0\/\">10. Amostragem e Distribui\u00e7\u00f5es Emp\u00edricas<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/10-0\/10-1\/\">10.1. Distribui\u00e7\u00f5es Emp\u00edricas<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/10-0\/10-2\/\">10.2. Amostragem de uma Popula\u00e7\u00e3o<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/10-0\/10-3\/\">10.3. Distribui\u00e7\u00e3o Emp\u00edrica de uma<br \/>\nEstat\u00edstica<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/10-0\/10-4\/\">10.4. Amostragem Aleat\u00f3ria em Python <\/a><\/li>\n<\/ul>\n<\/li>\n<li style=\"margin-bottom: 5px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/11-0\/\">11. Testando Hip\u00f3teses<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/11-0\/11-1\/\">11.1. Avaliando um Modelo<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/11-0\/11-2\/\">11.2. M\u00faltiplas Categorias<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/11-0\/11-3\/\">11.3. Decis\u00f5es e Incertezas<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/11-0\/11-4\/\">11.4. Probabilidades de Erro<\/a><\/li>\n<\/ul>\n<\/li>\n<li style=\"margin-bottom: 5px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/12-0\/\">12. Comparando Duas Amostras<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/12-0\/12-1\/\">12.1. Teste A\/B<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/12-0\/12-2\/\">12.2. Causalidade<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/12-0\/12-3\/\">12.3. Esvaziar<\/a><\/li>\n<\/ul>\n<\/li>\n<li style=\"margin-bottom: 5px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/13-0\/\">13. Estima\u00e7\u00e3o<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/13-0\/13-1\/\">13.1. Percentis<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/13-0\/13-2\/\">13.2. O Bootstrap<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/13-0\/13-3\/\">13.3. Intervalos de Confian\u00e7a<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/13-0\/13-4\/\">13.4. Usando Intervalos de Confian\u00e7a<\/a><\/li>\n<\/ul>\n<\/li>\n<li style=\"margin-bottom: 5px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/14-0\/\">14. Por que a M\u00e9dia \u00e9 Importante<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/14-0\/14-1\/\">14.1. Propriedades da M\u00e9dia<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/14-0\/14-2\/\">14.2. Variabilidade<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/14-0\/14-3\/\">14.3. O DP e a Curva Normal<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/14-0\/14-4\/\">14.4. Teorema Central do Limite<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/14-0\/14-5\/\">14.5. Variabilidade da M\u00e9dia da Amostra<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/14-0\/14-6\/\">14.6. Escolhendo um Tamanho de Amostra<\/a><\/li>\n<\/ul>\n<\/li>\n<li style=\"margin-bottom: 5px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/15-0\/\">15. Previs\u00e3o<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/15-0\/15-1\/\">15.1. Correla\u00e7\u00e3o<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/15-0\/15-2\/\">15.2. Linha de Regress\u00e3o<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/15-0\/15-3\/\">15.3. M\u00e9todo dos M\u00ednimos Quadrados<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/15-0\/15-4\/\">15.4. Regress\u00e3o de M\u00ednimos Quadrados<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/15-0\/15-5\/\">15.5. Diagn\u00f3sticos Visuais<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/15-0\/15-6\/\">15.6. Diagn\u00f3stico Num\u00e9rico<\/a><\/li>\n<\/ul>\n<\/li>\n<\/ol>\n<\/div>\n<p><!-- Main Content --><\/p>\n<div style=\"overflow: hidden\">\n<p><!--###########################################################################################################################################################--><\/p>\n<pre><code><span style=\"color: black\">from datascience import *\r\npath_data = '..\/..\/..\/assets\/data\/'\r\nimport numpy as np\r\n\r\nimport matplotlib\r\nmatplotlib.use('Agg')\r\n%matplotlib inline\r\nimport matplotlib.pyplot as plots\r\nplots.style.use('fivethirtyeight')<\/span><\/code><\/pre>\n<h1 id=\"classifica\u00e7\u00e3o-cruzada-por-mais-de-uma-vari\u00e1vel\" style=\"text-align: center\">Classifica\u00e7\u00e3o Cruzada por Mais de uma Vari\u00e1vel<\/h1>\n<p style=\"text-align: justify\">Quando os indiv\u00edduos t\u00eam m\u00faltiplas caracter\u00edsticas, existem muitas maneiras diferentes de classific\u00e1-los. Por exemplo, se tivermos uma popula\u00e7\u00e3o de estudantes universit\u00e1rios para cada um dos quais registramos uma especializa\u00e7\u00e3o e o n\u00famero de anos na faculdade, ent\u00e3o os estudantes podem ser classificados por especializa\u00e7\u00e3o, ou por ano, ou por uma combina\u00e7\u00e3o de especializa\u00e7\u00e3o e ano.<\/p>\n<p style=\"text-align: justify\">O m\u00e9todo <code>group<\/code> tamb\u00e9m nos permite classificar os indiv\u00edduos de acordo com m\u00faltiplas vari\u00e1veis. Isso \u00e9 chamado de <em>classifica\u00e7\u00e3o cruzada<\/em>.<\/p>\n<h2>Duas Vari\u00e1veis: Contando o N\u00famero em Cada Categoria Pareada<\/h2>\n<p style=\"text-align: justify\">A tabela <code>more_cones<\/code> registra o sabor, a cor e o pre\u00e7o de seis casquinhas de sorvete.<\/p>\n<pre><code><span style=\"color: black\">more_cones = Table().with_columns(\r\n    'Flavor', make_array('strawberry', 'chocolate', 'chocolate', 'strawberry', 'chocolate', 'bubblegum'),\r\n    'Color', make_array('pink', 'light brown', 'dark brown', 'pink', 'dark brown', 'pink'),\r\n    'Price', make_array(3.55, 4.75, 5.25, 5.25, 5.25, 4.75)\r\n)\r\n\r\nmore_cones<\/span><\/code><\/pre>\n<table style=\"font-family: monospace;border-collapse: collapse;width: auto;margin-left: 1em\" border=\"1\">\n<thead>\n<tr style=\"background-color: #f0f0f0;border-bottom: 2px solid #ddd\">\n<th style=\"text-align: left;padding: 4px 8px\">Flavor<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">Color<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">Price<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">strawberry<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">pink<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">3.55<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">chocolate<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">light brown<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">4.75<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">chocolate<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">dark brown<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">5.25<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">strawberry<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">pink<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">5.25<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">chocolate<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">dark brown<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">5.25<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">bubblegum<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">pink<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">4.75<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p style=\"text-align: justify\">Sabemos como usar <code>group<\/code> para contar a quantidade de casquinhas de cada sabor:<\/p>\n<pre><code><span style=\"color: black\">more_cones.group('Flavor')<\/span><\/code><\/pre>\n<table style=\"font-family: monospace;border-collapse: collapse;width: auto;margin-left: 1em\" border=\"1\">\n<thead>\n<tr style=\"background-color: #f0f0f0;border-bottom: 2px solid #ddd\">\n<th style=\"text-align: left;padding: 4px 8px\">Flavor<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">count<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">bubblegum<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">1<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">chocolate<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">3<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">strawberry<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">2<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p style=\"text-align: justify\">Mas agora cada cone tem tamb\u00e9m uma cor. Para classificar os cones tanto por sabor quanto por cor, passaremos uma lista de r\u00f3tulos como argumento para <code>group<\/code>. A tabela resultante tem uma linha para cada <em>combina\u00e7\u00e3o \u00fanica<\/em> de valores que aparecem juntos nas colunas agrupadas. Como antes, um \u00fanico argumento (uma lista, neste caso, mas uma array tamb\u00e9m funcionaria) fornece contagens de linhas.<\/p>\n<p style=\"text-align: justify\">Embora existam seis cones, h\u00e1 apenas quatro combina\u00e7\u00f5es \u00fanicas de sabor e cor. Dois dos cones eram de chocolate marrom escuro e dois de morango rosa.<\/p>\n<pre><code><span style=\"color: black\">more_cones.group(['Flavor', 'Color'])<\/span><\/code><\/pre>\n<table style=\"font-family: monospace;border-collapse: collapse;width: auto;margin-left: 1em\" border=\"1\">\n<thead>\n<tr style=\"background-color: #f0f0f0;border-bottom: 2px solid #ddd\">\n<th style=\"text-align: left;padding: 4px 8px\">Flavor<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">Color<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">count<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">bubblegum<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">pink<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">1<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">chocolate<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">dark brown<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">2<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">chocolate<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">light brown<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">1<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">strawberry<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">pink<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">2<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<h2>Duas vari\u00e1veis: Encontrando uma caracter\u00edstica de cada categoria emparelhada<\/h2>\n<p style=\"text-align: justify\">Um segundo argumento agrega todas as outras colunas que n\u00e3o est\u00e3o na lista de colunas agrupadas.<\/p>\n<pre><code><span style=\"color: black\">more_cones.group(['Flavor', 'Color'], sum)<\/span><\/code><\/pre>\n<table style=\"font-family: monospace;border-collapse: collapse;width: auto;margin-left: 1em\" border=\"1\">\n<thead>\n<tr style=\"background-color: #f0f0f0;border-bottom: 2px solid #ddd\">\n<th style=\"text-align: left;padding: 4px 8px\">Flavor<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">Color<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">Price sum<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">bubblegum<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">pink<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">4.75<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">chocolate<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">dark brown<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">10.5<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">chocolate<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">light brown<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">4.75<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">strawberry<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">pink<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">8.8<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p style=\"text-align: justify\"><strong>Tr\u00eas ou Mais Vari\u00e1veis.<\/strong> Voc\u00ea pode usar <code>group<\/code> para classificar linhas por tr\u00eas ou mais vari\u00e1veis categ\u00f3ricas. Basta inclu\u00ed-las em uma \u00fanica lista que ser\u00e1 passada como primeiro argumento. Mas a classifica\u00e7\u00e3o cruzada por v\u00e1rias vari\u00e1veis pode se tornar complexa, pois o n\u00famero de combina\u00e7\u00f5es distintas de categorias pode ser bastante grande.<\/p>\n<h2>Tabelas Pivot: Rearranjando a Sa\u00edda do <code>group<\/code><\/h2>\n<p style=\"text-align: justify\">Muitos usos de classifica\u00e7\u00e3o cruzada envolvem apenas duas vari\u00e1veis categ\u00f3ricas, como <code>Flavor<\/code> e <code>Color<\/code> no exemplo acima. Nesses casos, \u00e9 poss\u00edvel exibir os resultados da classifica\u00e7\u00e3o em um tipo diferente de tabela, chamada <em>tabela Pivot<\/em>. Tabelas Pivot, tamb\u00e9m conhecidas como <em>tabelas de conting\u00eancia<\/em>, facilitam o trabalho com dados que foram classificados de acordo com duas vari\u00e1veis.<\/p>\n<p style=\"text-align: justify\">Lembre-se do uso de <code>group<\/code> para contar o n\u00famero de cones em cada categoria combinada de sabor e cor:<\/p>\n<pre><code><span style=\"color: black\">more_cones.group(['Flavor', 'Color'])<\/span><\/code><\/pre>\n<table style=\"font-family: monospace;border-collapse: collapse;width: auto;margin-left: 1em\" border=\"1\">\n<thead>\n<tr style=\"background-color: #f0f0f0;border-bottom: 2px solid #ddd\">\n<th style=\"text-align: left;padding: 4px 8px\">Flavor<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">Color<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">count<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">bubblegum<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">pink<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">1<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">chocolate<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">dark brown<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">2<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">chocolate<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">light brown<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">1<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">strawberry<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">pink<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">2<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p style=\"text-align: justify\">Os mesmos dados podem ser exibidos de forma diferente usando o m\u00e9todo de tabela <code>pivot<\/code>. Ignore o c\u00f3digo por um momento e apenas examine a tabela de resultados.<\/p>\n<pre><code><span style=\"color: black\">more_cones.pivot('Flavor', 'Color')<\/span><\/code><\/pre>\n<table style=\"font-family: monospace;border-collapse: collapse;width: auto;margin-left: 1em\" border=\"1\">\n<thead>\n<tr style=\"background-color: #f0f0f0;border-bottom: 2px solid #ddd\">\n<th style=\"text-align: left;padding: 4px 8px\">Color<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">bubblegum<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">chocolate<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">strawberry<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">dark brown<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">0<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">2<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">0<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">light brown<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">0<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">1<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">0<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">pink<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">1<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">0<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">2<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p style=\"text-align: justify\">Observe como esta tabela exibe todos os nove pares poss\u00edveis de sabor e cor, incluindo pares como &#8220;chiclete marrom escuro&#8221; que n\u00e3o existem em nossos dados. Observe tamb\u00e9m que a contagem em cada par aparece no corpo da tabela: para encontrar o n\u00famero de cones de chocolate marrom claro, percorra com os olhos a linha <code>marrom claro<\/code> at\u00e9 encontrar a coluna <code>chocolate<\/code>.<\/p>\n<p style=\"text-align: justify\">O m\u00e9todo <code>group<\/code> recebe uma lista de dois r\u00f3tulos porque \u00e9 flex\u00edvel: poderia receber um, tr\u00eas ou mais. Por outro lado, <code>pivot<\/code> sempre recebe dois r\u00f3tulos de coluna, um para determinar as colunas e outro para determinar as linhas.<\/p>\n<p style=\"text-align: justify\"><strong><code>pivot<\/code><\/strong><\/p>\n<p style=\"text-align: justify\">O m\u00e9todo <code>pivot<\/code> \u00e9 intimamente relacionado ao m\u00e9todo <code>group<\/code>: ele agrupa linhas que compartilham uma combina\u00e7\u00e3o de valores. Ele \u00e9 diferente de <code>group<\/code> porque organiza os valores resultantes em uma grade. O primeiro argumento para <code>pivot<\/code> \u00e9 o r\u00f3tulo de uma coluna que cont\u00e9m os valores que ser\u00e3o usados para formar novas colunas no resultado. O segundo argumento \u00e9 o r\u00f3tulo de uma coluna usada para as linhas. O resultado fornece a contagem de todas as linhas da tabela original que compartilham a combina\u00e7\u00e3o de valores de coluna e linha.<\/p>\n<p style=\"text-align: justify\">Assim como <code>group<\/code>, <code>pivot<\/code> pode ser usado com argumentos adicionais para encontrar caracter\u00edsticas de cada combina\u00e7\u00e3o de categorias. Um terceiro argumento opcional chamado <code>values<\/code> indica uma coluna de valores que substituir\u00e1 as contagens em cada c\u00e9lula da grade. No entanto, todos esses valores n\u00e3o ser\u00e3o exibidos; o quarto argumento <code>collect<\/code> indica como colet\u00e1-los em um \u00fanico valor agregado a ser exibido na c\u00e9lula.<\/p>\n<p style=\"text-align: justify\">Um exemplo ajudar\u00e1 a esclarecer isso. Aqui est\u00e1 <code>pivot<\/code> sendo usado para encontrar o pre\u00e7o total dos cones em cada c\u00e9lula.<\/p>\n<pre><code><span style=\"color: black\">more_cones.pivot('Flavor', 'Color', values='Price', collect=sum)<\/span><\/code><\/pre>\n<table style=\"font-family: monospace;border-collapse: collapse;width: auto;margin-left: 1em\" border=\"1\">\n<thead>\n<tr style=\"background-color: #f0f0f0;border-bottom: 2px solid #ddd\">\n<th style=\"text-align: left;padding: 4px 8px\">Color<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">bubblegum<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">chocolate<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">strawberry<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">dark brown<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">0.0<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">10.5<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">0.0<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">light brown<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">0.0<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">4.75<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">0.0<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">pink<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">4.75<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">0.0<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">8.8<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p style=\"text-align: justify\">E aqui est\u00e1 o <code>group<\/code> fazendo a mesma coisa.<\/p>\n<pre><code><span style=\"color: black\">more_cones.group(['Flavor', 'Color'], sum)<\/span><\/code><\/pre>\n<table style=\"font-family: monospace;border-collapse: collapse;width: auto;margin-left: 1em\" border=\"1\">\n<thead>\n<tr style=\"background-color: #f0f0f0;border-bottom: 2px solid #ddd\">\n<th style=\"text-align: left;padding: 4px 8px\">Flavor<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">Color<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">Price sum<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">bubblegum<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">pink<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">4.75<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">chocolate<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">dark brown<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">10.5<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">chocolate<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">light brown<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">4.75<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">strawberry<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">pink<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">8.8<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p style=\"text-align: justify\">Embora os n\u00fameros em ambas as tabelas sejam os mesmos, a tabela produzida pelo <code>pivot<\/code> \u00e9 mais f\u00e1cil de ler e se presta mais facilmente \u00e0 an\u00e1lise. A vantagem do <code>pivot<\/code> \u00e9 que ele coloca os valores agrupados em colunas adjacentes, para que possam ser combinados e comparados.<\/p>\n<h2>Exemplo: Educa\u00e7\u00e3o e Renda dos Adultos Californianos<\/h2>\n<p style=\"text-align: justify\">O Portal de Dados Abertos do Estado da Calif\u00f3rnia \u00e9 uma fonte rica de informa\u00e7\u00f5es sobre a vida dos californianos. \u00c9 nossa fonte de um <a href=\"http:\/\/data.ca.gov\/dataset\/ca-educational-attainment-personal-income\">conjunto de dados<\/a> sobre realiza\u00e7\u00e3o educacional e renda pessoal entre os californianos nos anos de 2008 a 2014. Os dados s\u00e3o derivados da Pesquisa da Popula\u00e7\u00e3o Atual do Censo dos EUA.<\/p>\n<p style=\"text-align: justify\">Para cada ano, a tabela registra a <code>Contagem da Popula\u00e7\u00e3o<\/code> de californianos em muitas combina\u00e7\u00f5es diferentes de idade, g\u00eanero, realiza\u00e7\u00e3o educacional e renda pessoal. Estudaremos apenas os dados para o ano de 2014.<\/p>\n<pre><code><span style=\"color: black\">full_table = Table.read_table(path_data + 'educ_inc.csv')\r\nca_2014 = full_table.where('Year', are.equal_to('1\/1\/14 0:00')).where('Age', are.not_equal_to('00 to 17'))\r\nca_2014<\/span><\/code><\/pre>\n<table style=\"font-family: monospace;border-collapse: collapse;width: auto;margin-left: 1em\" border=\"1\">\n<thead>\n<tr style=\"background-color: #f0f0f0;border-bottom: 2px solid #ddd\">\n<th style=\"text-align: left;padding: 4px 8px\">Year<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">Age<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">Gender<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">Educational Attainment<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">Personal Income<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">Population Count<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">1\/1\/14 0:00<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">18 to 64<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Female<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">No high school diploma<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">H: 75,000 and over<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">2058<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">1\/1\/14 0:00<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">65 to 80+<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Male<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">No high school diploma<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">H: 75,000 and over<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">2153<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">1\/1\/14 0:00<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">65 to 80+<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Female<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">No high school diploma<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">G: 50,000 to 74,999<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">4666<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">1\/1\/14 0:00<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">65 to 80+<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Female<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">High school or equivalent<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">H: 75,000 and over<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">7122<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">1\/1\/14 0:00<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">65 to 80+<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Female<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">No high school diploma<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">F: 35,000 to 49,999<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">7261<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">1\/1\/14 0:00<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">65 to 80+<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Male<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">No high school diploma<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">G: 50,000 to 74,999<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">8569<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">1\/1\/14 0:00<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">18 to 64<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Female<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">No high school diploma<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">G: 50,000 to 74,999<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">14635<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">1\/1\/14 0:00<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">65 to 80+<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Male<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">No high school diploma<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">F: 35,000 to 49,999<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">15212<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">1\/1\/14 0:00<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">65 to 80+<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Male<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">College, less than 4-yr degree<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">B: 5,000 to 9,999<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">15423<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">1\/1\/14 0:00<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">65 to 80+<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Female<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Bachelor&#8217;s degree or higher<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">A: 0 to 4,999<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">15459<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p style=\"text-align: justify\">Cada linha da tabela corresponde a uma combina\u00e7\u00e3o de idade, g\u00eanero, n\u00edvel educacional e renda. Existem 127 dessas combina\u00e7\u00f5es no total!<\/p>\n<p style=\"text-align: justify\">Como primeiro passo, \u00e9 uma boa ideia come\u00e7ar com apenas uma ou duas vari\u00e1veis. Vamos nos concentrar apenas em um par: realiza\u00e7\u00e3o educacional e renda pessoal.<\/p>\n<pre><code><span style=\"color: black\">educ_inc = ca_2014.select('Educational Attainment', 'Personal Income', 'Population Count')\r\neduc_inc<\/span><\/code><\/pre>\n<table style=\"font-family: monospace;border-collapse: collapse;width: auto;margin-left: 1em\" border=\"1\">\n<thead>\n<tr style=\"background-color: #f0f0f0;border-bottom: 2px solid #ddd\">\n<th style=\"text-align: left;padding: 4px 8px\">Educational Attainment<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">Personal Income<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">Population Count<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">No high school diploma<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">H: 75,000 and over<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">2058<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">No high school diploma<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">H: 75,000 and over<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">2153<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">No high school diploma<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">G: 50,000 to 74,999<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">4666<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">High school or equivalent<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">H: 75,000 and over<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">7122<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">No high school diploma<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">F: 35,000 to 49,999<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">7261<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">No high school diploma<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">G: 50,000 to 74,999<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">8569<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">No high school diploma<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">G: 50,000 to 74,999<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">14635<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">No high school diploma<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">F: 35,000 to 49,999<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">15212<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">College, less than 4-yr degree<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">B: 5,000 to 9,999<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">15423<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Bachelor&#8217;s degree or higher<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">A: 0 to 4,999<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">15459<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p style=\"text-align: justify\">Vamos come\u00e7ar olhando apenas para o n\u00edvel educacional. As categorias dessa vari\u00e1vel foram subdivididas pelos diferentes n\u00edveis de renda. Portanto, agruparemos a tabela por <code>Educational Attainment<\/code> e <code>sum<\/code> a <code>Population Count<\/code> em cada categoria.<\/p>\n<pre><code><span style=\"color: black\">education = educ_inc.select('Educational Attainment', 'Population Count')\r\neduc_totals = education.group('Educational Attainment', sum)\r\neduc_totals<\/span><\/code><\/pre>\n<table style=\"font-family: monospace;border-collapse: collapse;width: auto;margin-left: 1em\" border=\"1\">\n<thead>\n<tr style=\"background-color: #f0f0f0;border-bottom: 2px solid #ddd\">\n<th style=\"text-align: left;padding: 4px 8px\">Educational Attainment<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">Population Count sum<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Bachelor&#8217;s degree or higher<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">8525698<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">College, less than 4-yr degree<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">7775497<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">High school or equivalent<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">6294141<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">No high school diploma<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">4258277<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p style=\"text-align: justify\">Existem apenas quatro categorias de n\u00edvel de escolaridade. As contagens s\u00e3o t\u00e3o grandes que \u00e9 mais \u00fatil observar as porcentagens. Para isso, usaremos a fun\u00e7\u00e3o &#8216;percents&#8217; que definimos em uma se\u00e7\u00e3o anterior. Ela converte uma matriz de n\u00fameros para uma matriz de porcentagens do total na matriz de entrada.<\/p>\n<pre><code><span style=\"color: black\">def percents(array_x):\r\n    return np.round( (array_x\/sum(array_x))*100, 2)<\/span><\/code><\/pre>\n<p>&nbsp;<\/p>\n<p style=\"text-align: justify\">Agora temos a distribui\u00e7\u00e3o do n\u00edvel de escolaridade entre os adultos californianos. Mais de 30% t\u00eam diploma de bacharel ou superior, enquanto quase 16% n\u00e3o possuem diploma de ensino m\u00e9dio.<\/p>\n<pre><code><span style=\"color: black\">educ_distribution = educ_totals.with_column(\r\n    'Population Percent', percents(educ_totals.column(1))\r\n)\r\neduc_distribution<\/span><\/code><\/pre>\n<table style=\"font-family: monospace;border-collapse: collapse;width: auto;margin-left: 1em\" border=\"1\">\n<thead>\n<tr style=\"background-color: #f0f0f0;border-bottom: 2px solid #ddd\">\n<th style=\"text-align: left;padding: 4px 8px\">Educational Attainment<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">Population Count sum<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">Population Percent<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Bachelor&#8217;s degree or higher<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">8525698<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">31.75<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">College, less than 4-yr degree<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">7775497<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">28.96<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">High school or equivalent<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">6294141<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">23.44<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">No high school diploma<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">4258277<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">15.86<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p style=\"text-align: justify\">Usando <code>pivot<\/code>, podemos obter uma tabela de conting\u00eancia (uma tabela de contagens) de adultos californianos classificados cruzadamente por <code>Educational Attainment<\/code> e <code>Personal Income<\/code>.<\/p>\n<pre><code><span style=\"color: black\">totals = educ_inc.pivot('Educational Attainment', 'Personal Income', values='Population Count', collect=sum)\r\ntotals<\/span><\/code><\/pre>\n<table style=\"font-family: monospace;border-collapse: collapse;width: auto;margin-left: 1em\" border=\"1\">\n<thead>\n<tr style=\"background-color: #f0f0f0;border-bottom: 2px solid #ddd\">\n<th style=\"text-align: left;padding: 4px 8px\">Personal Income<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">Bachelor&#8217;s degree or higher<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">College, less than 4-yr degree<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">High school or equivalent<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">No high school diploma<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">A: 0 to 4,999<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">575491<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">985011<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">1161873<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">1204529<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">B: 5,000 to 9,999<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">326020<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">810641<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">626499<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">597039<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">C: 10,000 to 14,999<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">452449<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">798596<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">692661<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">664607<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">D: 15,000 to 24,999<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">773684<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">1345257<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">1252377<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">875498<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">E: 25,000 to 34,999<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">693884<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">1091642<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">929218<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">464564<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">F: 35,000 to 49,999<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">1122791<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">1112421<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">782804<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">260579<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">G: 50,000 to 74,999<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">1594681<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">883826<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">525517<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">132516<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">H: 75,000 and over<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">2986698<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">748103<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">323192<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">58945<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p style=\"text-align: justify\">Aqui voc\u00ea v\u00ea o poder do <code>pivot<\/code> sobre outros m\u00e9todos de cruzamento de classifica\u00e7\u00e3o. Cada coluna de contagens \u00e9 uma distribui\u00e7\u00e3o de renda pessoal em um n\u00edvel espec\u00edfico de realiza\u00e7\u00e3o educacional. Converter as contagens em porcentagens nos permite comparar as quatro distribui\u00e7\u00f5es.<\/p>\n<pre><code><span style=\"color: black\">distributions = totals.select(0).with_columns(\r\n    \"Bachelor's degree or higher\", percents(totals.column(1)),\r\n    'College, less than 4-yr degree', percents(totals.column(2)),\r\n    'High school or equivalent', percents(totals.column(3)),\r\n    'No high school diploma', percents(totals.column(4))\r\n    )\r\n\r\ndistributions<\/span><\/code><\/pre>\n<table style=\"font-family: monospace;border-collapse: collapse;width: auto;margin-left: 1em\" border=\"1\">\n<thead>\n<tr style=\"background-color: #f0f0f0;border-bottom: 2px solid #ddd\">\n<th style=\"text-align: left;padding: 4px 8px\">Personal Income<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">Bachelor&#8217;s degree or higher<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">College, less than 4-yr degree<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">High school or equivalent<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">No high school diploma<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">A: 0 to 4,999<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">6.75<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">12.67<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">18.46<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">28.29<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">B: 5,000 to 9,999<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">3.82<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">10.43<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">9.95<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">14.02<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">C: 10,000 to 14,999<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">5.31<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">10.27<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">11.0<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">15.61<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">D: 15,000 to 24,999<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">9.07<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">17.3<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">19.9<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">20.56<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">E: 25,000 to 34,999<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">8.14<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">14.04<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">14.76<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">10.91<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">F: 35,000 to 49,999<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">13.17<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">14.31<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">12.44<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">6.12<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">G: 50,000 to 74,999<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">18.7<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">11.37<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">8.35<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">3.11<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">H: 75,000 and over<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">35.03<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">9.62<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">5.13<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">1.38<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p style=\"text-align: justify\">\u00c0 primeira vista, voc\u00ea pode ver que mais de 35% daqueles com diplomas de Bacharel ou superior tinham renda de $75,000 ou mais, enquanto menos de 10% das pessoas nas outras categorias de educa\u00e7\u00e3o tinham esse n\u00edvel de renda.<\/p>\n<p style=\"text-align: justify\">O gr\u00e1fico de barras abaixo compara as distribui\u00e7\u00f5es de renda pessoal dos adultos californianos que n\u00e3o t\u00eam diploma do ensino m\u00e9dio com aqueles que completaram um diploma de Bacharel ou superior. A diferen\u00e7a nas distribui\u00e7\u00f5es \u00e9 marcante. H\u00e1 uma clara associa\u00e7\u00e3o positiva entre a educa\u00e7\u00e3o e a renda pessoal.<\/p>\n<p>&nbsp;<\/p>\n<pre><code><span style=\"color: black\">distributions.select(0, 1, 4).barh(0)<\/span><\/code><\/pre>\n<p><img fetchpriority=\"high\" decoding=\"async\" class=\"alignnone wp-image-552\" src=\"https:\/\/literaciadigital.ufms.br\/files\/2025\/07\/8-3-1.png\" alt=\"\" width=\"748\" height=\"244\" srcset=\"https:\/\/literaciadigital.ufms.br\/files\/2025\/07\/8-3-1.png 813w, https:\/\/literaciadigital.ufms.br\/files\/2025\/07\/8-3-1-300x98.png 300w, https:\/\/literaciadigital.ufms.br\/files\/2025\/07\/8-3-1-768x250.png 768w, https:\/\/literaciadigital.ufms.br\/files\/2025\/07\/8-3-1-730x238.png 730w\" sizes=\"(max-width: 748px) 100vw, 748px\" \/><\/p>\n<p><!--###########################################################################################################################################################--><\/p>\n<table width=\"100%\">\n<tbody>\n<tr>\n<td align=\"left\"><a class=\"next-page-link\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/8-0\/8-2\/\">\u2190 Cap\u00edtulo 8.2 &#8211; Classificando por uma Vari\u00e1vel<\/a><\/td>\n<td align=\"right\"><a class=\"next-page-link\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/8-0\/8-4\/\">Cap\u00edtulo 8.4 &#8211; Unindo Tabelas por Colunas \u2192<\/a><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><!--###########################################################################################################################################################--><\/p>\n<\/div>\n<\/div>\n<div style=\"clear: both;height: 1px;margin-top: -1px\"><\/div>\n","protected":false},"excerpt":{"rendered":"<p>\u00cdndice 1. O que \u00e9 Ci\u00eancia de Dados? 1.1. Introdu\u00e7\u00e3o 1.1.1. Ferramentas Computacionais 1.1.2. T\u00e9cnicas Estat\u00edsticas 1.2. Por que Ci\u00eancia de Dados? 1.3. Tra\u00e7ando os Cl\u00e1ssicos 1.3.1. Personagens Liter\u00e1rios 1.3.2. Outro Tipo de Personagem 2. Causalidade e Experimentos 2.1. John Snow e a Bomba da Broad Street 2.2. O &#8220;Grande Experimento&#8221; de Snow 2.3. Estabelecendo [&hellip;]<\/p>\n","protected":false},"author":21894,"featured_media":0,"parent":484,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"page-templates\/full-width.php","meta":{"footnotes":""},"coauthors":[14],"class_list":["post-547","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/literaciadigital.ufms.br\/en\/wp-json\/wp\/v2\/pages\/547","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/literaciadigital.ufms.br\/en\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/literaciadigital.ufms.br\/en\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/literaciadigital.ufms.br\/en\/wp-json\/wp\/v2\/users\/21894"}],"replies":[{"embeddable":true,"href":"https:\/\/literaciadigital.ufms.br\/en\/wp-json\/wp\/v2\/comments?post=547"}],"version-history":[{"count":12,"href":"https:\/\/literaciadigital.ufms.br\/en\/wp-json\/wp\/v2\/pages\/547\/revisions"}],"predecessor-version":[{"id":979,"href":"https:\/\/literaciadigital.ufms.br\/en\/wp-json\/wp\/v2\/pages\/547\/revisions\/979"}],"up":[{"embeddable":true,"href":"https:\/\/literaciadigital.ufms.br\/en\/wp-json\/wp\/v2\/pages\/484"}],"wp:attachment":[{"href":"https:\/\/literaciadigital.ufms.br\/en\/wp-json\/wp\/v2\/media?parent=547"}],"wp:term":[{"taxonomy":"author","embeddable":true,"href":"https:\/\/literaciadigital.ufms.br\/en\/wp-json\/wp\/v2\/coauthors?post=547"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}