{"id":542,"date":"2025-07-28T12:58:19","date_gmt":"2025-07-28T16:58:19","guid":{"rendered":"https:\/\/literaciadigital.ufms.br\/?page_id=542"},"modified":"2025-10-06T20:28:14","modified_gmt":"2025-10-07T00:28:14","slug":"8-2","status":"publish","type":"page","link":"https:\/\/literaciadigital.ufms.br\/en\/data8\/8-0\/8-2\/","title":{"rendered":"Cap\u00edtulo 8.2"},"content":{"rendered":"<div style=\"position: relative\">\n<div style=\"float: left;width: 300px;background-color: #f5f5f5;border: 1px solid #ddd;border-radius: 5px;padding: 15px;margin-right: 20px;margin-bottom: 5px;overflow: hidden\">\n<h3 style=\"margin: 0 0 10px 0;padding-bottom: 8px;border-bottom: 1px solid #ddd\">\u00cdndice<\/h3>\n<ol style=\"margin: 0;padding-left: 0;list-style-type: none\">\n<li style=\"margin-bottom: 5px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/1-0\/\">1. O que \u00e9 Ci\u00eancia de Dados?<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/1-0\/1-1\/\">1.1. Introdu\u00e7\u00e3o<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/1-0\/1-1\/1-1\/\">1.1.1. Ferramentas Computacionais<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/1-0\/1-1\/1-2\/\">1.1.2. T\u00e9cnicas Estat\u00edsticas<\/a><\/li>\n<\/ul>\n<\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/1-0\/1-2\/\">1.2. Por que Ci\u00eancia de Dados?<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/1-0\/1-3\/\">1.3. Tra\u00e7ando os Cl\u00e1ssicos<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/1-0\/1-3\/3-1\/\">1.3.1. Personagens Liter\u00e1rios<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/1-0\/1-3\/3-2\/\">1.3.2. Outro Tipo de Personagem<\/a><\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<\/li>\n<li style=\"margin-bottom: 5px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/2-0\/\">2. Causalidade e Experimentos<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/2-0\/2-1\/\">2.1. John Snow e a Bomba da Broad Street<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/2-0\/2-2\/\">2.2. O &#8220;Grande Experimento&#8221; de Snow<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/2-0\/2-3\/\">2.3. Estabelecendo Causalidade<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/2-0\/2-4\/\">2.4. Randomiza\u00e7\u00e3o<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/2-0\/2-5\/\">2.5. Notas Finais<\/a><\/li>\n<\/ul>\n<\/li>\n<li style=\"margin-bottom: 5px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/3-0\/\">3. Progamando em Python<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/3-0\/3-1\/\">3.1. Express\u00f5es<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/3-0\/3-2\/\">3.2. Nomes<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/3-0\/3-2\/2-1\/\">3.2.1. Exemplo: Taxas de Crescimento<\/a><\/li>\n<\/ul>\n<\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/3-0\/3-3\/\">3.3. Chamadas<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/3-0\/3-4\/\">3.4. Introdu\u00e7\u00e3o \u00e0s Tabelas<\/a><\/li>\n<\/ul>\n<\/li>\n<li style=\"margin-bottom: 5px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/4-0\/\">4. Tipos de Dados<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/4-0\/4-1\/\">4.1. N\u00fameros<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/4-0\/4-2\/\">4.2. Strings<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/4-0\/4-2\/2-1\/\">4.2.1. M\u00e9todos de Strings<\/a><\/li>\n<\/ul>\n<\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/4-0\/4-3\/\">4.3. Compara\u00e7\u00f5es<\/a><\/li>\n<\/ul>\n<\/li>\n<li style=\"margin-bottom: 5px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/5-0\/\">5. Sequ\u00eancias<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/5-0\/5-1\/\">5.1. Arrays<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/5-0\/5-2\/\">5.2. Ranges<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/5-0\/5-3\/\">5.3. Mais sobre Arrays<\/a><\/li>\n<\/ul>\n<\/li>\n<li style=\"margin-bottom: 5px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/6-0\/\">6. Tabelas<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/6-0\/6-1\/\">6.1. Ordenando Linhas<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/6-0\/6-2\/\">6.2. Selecionando Linhas<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/6-0\/6-3\/\">6.3. Exemplo: Tend\u00eancias Populacionais<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/6-0\/6-4\/\">6.4. Examplo: Propor\u00e7\u00f5es de Sexos<\/a><\/li>\n<\/ul>\n<\/li>\n<li style=\"margin-bottom: 5px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/7-0\/\">7. Visualiza\u00e7\u00e3o<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/7-0\/7-1\/\">7.1. Visualizando Distribui\u00e7\u00f5es<br \/>\nCateg\u00f3ricas<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/7-0\/7-2\/\">7.2. Visualizando Distribui\u00e7\u00f5es Num\u00e9ricas<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/7-0\/7-3\/\">7.3. Gr\u00e1ficos Sobrepostos<\/a><\/li>\n<\/ul>\n<\/li>\n<li style=\"margin-bottom: 5px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/8-0\/\">8. Fun\u00e7\u00f5es e Tabelas<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/8-0\/8-1\/\">8.1. Aplicando Fun\u00e7\u00e3o a uma Coluna<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/8-0\/8-2\/\">8.2. Classificando por uma Vari\u00e1vel<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/8-0\/8-3\/\">8.3. Classifica\u00e7\u00e3o Cruzada<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/8-0\/8-4\/\">8.4. Unindo Tabelas por Colunas<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/8-0\/8-5\/\">8.5. Compartilhamento de Bicicletas<\/a><\/li>\n<\/ul>\n<\/li>\n<li style=\"margin-bottom: 5px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/9-0\/\">9. Aleatoriedade<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/9-0\/9-1\/\">9.1. Declara\u00e7\u00f5es Condicionais<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/9-0\/9-2\/\">9.2. Itera\u00e7\u00e3o<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/9-0\/9-3\/\">9.3. Simula\u00e7\u00e3o<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/9-0\/9-4\/\">9.4. O Problema de Monty Hall<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/9-0\/9-5\/\">9.5. Encontrando Probabilidades<\/a><\/li>\n<\/ul>\n<\/li>\n<li style=\"margin-bottom: 5px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/10-0\/\">10. Amostragem e Distribui\u00e7\u00f5es Emp\u00edricas<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/10-0\/10-1\/\">10.1. Distribui\u00e7\u00f5es Emp\u00edricas<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/10-0\/10-2\/\">10.2. Amostragem de uma Popula\u00e7\u00e3o<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/10-0\/10-3\/\">10.3. Distribui\u00e7\u00e3o Emp\u00edrica de uma<br \/>\nEstat\u00edstica<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/10-0\/10-4\/\">10.4. Amostragem Aleat\u00f3ria em Python <\/a><\/li>\n<\/ul>\n<\/li>\n<li style=\"margin-bottom: 5px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/11-0\/\">11. Testando Hip\u00f3teses<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/11-0\/11-1\/\">11.1. Avaliando um Modelo<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/11-0\/11-2\/\">11.2. M\u00faltiplas Categorias<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/11-0\/11-3\/\">11.3. Decis\u00f5es e Incertezas<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/11-0\/11-4\/\">11.4. Probabilidades de Erro<\/a><\/li>\n<\/ul>\n<\/li>\n<li style=\"margin-bottom: 5px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/12-0\/\">12. Comparando Duas Amostras<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/12-0\/12-1\/\">12.1. Teste A\/B<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/12-0\/12-2\/\">12.2. Causalidade<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/12-0\/12-3\/\">12.3. Esvaziar<\/a><\/li>\n<\/ul>\n<\/li>\n<li style=\"margin-bottom: 5px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/13-0\/\">13. Estima\u00e7\u00e3o<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/13-0\/13-1\/\">13.1. Percentis<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/13-0\/13-2\/\">13.2. O Bootstrap<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/13-0\/13-3\/\">13.3. Intervalos de Confian\u00e7a<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/13-0\/13-4\/\">13.4. Usando Intervalos de Confian\u00e7a<\/a><\/li>\n<\/ul>\n<\/li>\n<li style=\"margin-bottom: 5px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/14-0\/\">14. Por que a M\u00e9dia \u00e9 Importante<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/14-0\/14-1\/\">14.1. Propriedades da M\u00e9dia<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/14-0\/14-2\/\">14.2. Variabilidade<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/14-0\/14-3\/\">14.3. O DP e a Curva Normal<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/14-0\/14-4\/\">14.4. Teorema Central do Limite<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/14-0\/14-5\/\">14.5. Variabilidade da M\u00e9dia da Amostra<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/14-0\/14-6\/\">14.6. Escolhendo um Tamanho de Amostra<\/a><\/li>\n<\/ul>\n<\/li>\n<li style=\"margin-bottom: 5px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/15-0\/\">15. Previs\u00e3o<\/a>\n<ul style=\"margin: 5px 0 5px 15px;padding-left: 10px;list-style-type: none;border-left: 1px solid #ddd\">\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/15-0\/15-1\/\">15.1. Correla\u00e7\u00e3o<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/15-0\/15-2\/\">15.2. Linha de Regress\u00e3o<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/15-0\/15-3\/\">15.3. M\u00e9todo dos M\u00ednimos Quadrados<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/15-0\/15-4\/\">15.4. Regress\u00e3o de M\u00ednimos Quadrados<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/15-0\/15-5\/\">15.5. Diagn\u00f3sticos Visuais<\/a><\/li>\n<li style=\"margin-bottom: 3px\"><a style=\"padding: 2px 0\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/15-0\/15-6\/\">15.6. Diagn\u00f3stico Num\u00e9rico<\/a><\/li>\n<\/ul>\n<\/li>\n<\/ol>\n<\/div>\n<p><!-- Main Content --><\/p>\n<div style=\"overflow: hidden\">\n<p><!--###########################################################################################################################################################--><\/p>\n<pre><code><span style=\"color: black\">from datascience import *\r\npath_data = '..\/..\/..\/assets\/data\/'\r\nimport matplotlib\r\nmatplotlib.use('Agg')\r\n%matplotlib inline\r\nimport matplotlib.pyplot as plots\r\nplots.style.use('fivethirtyeight')\r\nimport numpy as np<\/span><\/code><\/pre>\n<h1 id=\"classificando-por-uma-vari\u00e1vel\" style=\"text-align: center\">Classificando por uma Vari\u00e1vel<\/h1>\n<p style=\"text-align: justify\">Os cientistas de dados frequentemente precisam classificar indiv\u00edduos em grupos de acordo com caracter\u00edsticas compartilhadas, e ent\u00e3o identificar algumas caracter\u00edsticas dos grupos. Por exemplo, no exemplo usando os dados de alturas de Galton, vimos que era \u00fatil classificar as fam\u00edlias de acordo com as alturas m\u00e9dias dos pais e depois encontrar a altura m\u00e9dia das crian\u00e7as em cada grupo.<\/p>\n<p style=\"text-align: justify\">Esta se\u00e7\u00e3o trata da classifica\u00e7\u00e3o de indiv\u00edduos em categorias que n\u00e3o s\u00e3o num\u00e9ricas. Come\u00e7amos lembrando o uso b\u00e1sico do <code>group<\/code>.<\/p>\n<h2>Contando o N\u00famero em Cada Categoria<\/h2>\n<p style=\"text-align: justify\">O m\u00e9todo <code>group<\/code> com um \u00fanico argumento conta o n\u00famero de linhas para cada categoria em uma coluna. O resultado cont\u00e9m uma linha para cada valor \u00fanico na coluna agrupada.<\/p>\n<p style=\"text-align: justify\">Aqui est\u00e1 uma pequena tabela de dados sobre cones de sorvete. O m\u00e9todo <code>group<\/code> pode ser usado para listar os sabores distintos e fornecer a contagem de cada sabor.<\/p>\n<pre><code><span style=\"color: black\">cones = Table().with_columns(\r\n    'Flavor', make_array('strawberry', 'chocolate', 'chocolate', 'strawberry', 'chocolate'),\r\n    'Price', make_array(3.55, 4.75, 6.55, 5.25, 5.25)\r\n)\r\ncones<\/span><\/code><\/pre>\n<table style=\"font-family: monospace;border-collapse: collapse;width: auto;margin-left: 1em\" border=\"1\">\n<thead>\n<tr style=\"background-color: #f0f0f0;border-bottom: 2px solid #ddd\">\n<th style=\"text-align: left;padding: 4px 8px\">Flavor<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">Price<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">strawberry<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">3.55<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">chocolate<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">4.75<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">chocolate<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">6.55<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">strawberry<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">5.25<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">chocolate<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">5.25<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<pre><code><span style=\"color: black\">cones.group('Flavor')<\/span><\/code><\/pre>\n<table style=\"font-family: monospace;border-collapse: collapse;width: auto;margin-left: 1em\" border=\"1\">\n<thead>\n<tr style=\"background-color: #f0f0f0;border-bottom: 2px solid #ddd\">\n<th style=\"text-align: left;padding: 4px 8px\">Flavor<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">count<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">chocolate<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">3<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">strawberry<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">2<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p>Existem duas categorias distintas, chocolate e morango. A chamada para <code>group<\/code> cria uma tabela de contagens em cada categoria. A coluna \u00e9 chamada <code>count<\/code> por padr\u00e3o e cont\u00e9m o n\u00famero de linhas em cada categoria.<\/p>\n<p>Perceba que tudo isso pode ser resolvido apenas a partir da coluna <code>Flavor<\/code>. A coluna <code>Price<\/code> n\u00e3o foi usada.<\/p>\n<p>Mas e se quis\u00e9ssemos o pre\u00e7o total dos cones de cada sabor diferente? A\u00ed \u00e9 onde entra o segundo argumento do <code>group<\/code>.<\/p>\n<h2 id=\"encontrando-uma-caracter-stica-de-cada-categoria\">Encontrando uma Caracter\u00edstica de Cada Categoria<\/h2>\n<p>O segundo argumento opcional do <code>group<\/code> nomeia a fun\u00e7\u00e3o que ser\u00e1 usada para agregar valores em outras colunas para todas aquelas linhas. Por exemplo, <code>sum<\/code> somar\u00e1 os pre\u00e7os em todas as linhas que correspondem a cada categoria. Esse resultado tamb\u00e9m cont\u00e9m uma linha para cada valor \u00fanico na coluna agrupada, mas tem o mesmo n\u00famero de colunas que a tabela original.<\/p>\n<p>Para encontrar o pre\u00e7o total de cada sabor, chamamos <code>group<\/code> novamente, com <code>Flavor<\/code> como primeiro argumento como antes. Mas desta vez h\u00e1 um segundo argumento: o nome da fun\u00e7\u00e3o <code>sum<\/code>.<\/p>\n<pre><code><span style=\"color: black\">cones.group('Flavor', sum)<\/span><\/code><\/pre>\n<table style=\"font-family: monospace;border-collapse: collapse;width: auto;margin-left: 1em\" border=\"1\">\n<thead>\n<tr style=\"background-color: #f0f0f0;border-bottom: 2px solid #ddd\">\n<th style=\"text-align: left;padding: 4px 8px\">Flavor<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">Price sum<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">chocolate<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">16.55<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">strawberry<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">8.8<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p style=\"text-align: justify\">Para criar esta nova tabela, o <code>group<\/code> calculou a soma das entradas de <code>Price<\/code> em todas as linhas correspondentes a cada sabor distinto. Os pre\u00e7os nas tr\u00eas linhas de <code>chocolate<\/code> somam $16.55 (voc\u00ea pode assumir que o pre\u00e7o est\u00e1 sendo medido em d\u00f3lares). Os pre\u00e7os nas duas linhas de <code>strawberry<\/code> totalizam $8.80.<\/p>\n<p style=\"text-align: justify\">O r\u00f3tulo da coluna rec\u00e9m-criada &#8220;sum&#8221; \u00e9 <code>Price sum<\/code>, que \u00e9 criado pegando o r\u00f3tulo da coluna que est\u00e1 sendo somada e acrescentando a palavra <code>sum<\/code>.<\/p>\n<p style=\"text-align: justify\">Como o <code>group<\/code> encontra a <code>sum<\/code> (soma) de todas as colunas que n\u00e3o s\u00e3o a dos categorias, n\u00e3o h\u00e1 necessidade de especificar que ele deve <code>sum<\/code> (somar) os pre\u00e7os.<\/p>\n<p style=\"text-align: justify\">Para entender mais detalhadamente o que o <code>group<\/code> est\u00e1 fazendo, observe que voc\u00ea poderia ter calculado os pre\u00e7os totais voc\u00ea mesmo, n\u00e3o apenas fazendo c\u00e1lculos mentais, mas tamb\u00e9m usando c\u00f3digo. Por exemplo, para encontrar o pre\u00e7o total de todos os cones de chocolate, voc\u00ea poderia come\u00e7ar criando uma nova tabela consistindo apenas dos cones de chocolate e, em seguida, acessar a coluna de pre\u00e7os:<\/p>\n<pre><code><span style=\"color: black\">cones.where('Flavor', are.equal_to('chocolate')).column('Price')<\/span><\/code><\/pre>\n<table style=\"font-family: monospace;border-spacing: 0;border-collapse: collapse;width: auto;margin-left: 1em\">\n<tbody>\n<tr>\n<td style=\"text-align: right;color: #888;padding-right: 0.5em\">Out[4]:<\/td>\n<td style=\"text-align: left\">array([4.75, 6.55, 5.25])<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<pre><code><span style=\"color: black\">sum(cones.where('Flavor', are.equal_to('chocolate')).column('Price'))<\/span><\/code><\/pre>\n<table style=\"font-family: monospace;border-spacing: 0;border-collapse: collapse;width: auto;margin-left: 1em\">\n<tbody>\n<tr>\n<td style=\"text-align: right;color: #888;padding-right: 0.5em\">Out[5]:<\/td>\n<td style=\"text-align: left\">16.55<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p>Isso \u00e9 o que <code>group<\/code> est\u00e1 fazendo para cada valor distinto em <code>Flavor<\/code>.<\/p>\n<pre><code><span style=\"color: black\"># Para cada valor distinto em `Flavor, acesse todas as linhas\r\n# e crie um array de `Price`\r\n\r\ncones_choc = cones.where('Flavor', are.equal_to('chocolate')).column('Price')\r\ncones_strawb = cones.where('Flavor', are.equal_to('strawberry')).column('Price')\r\n\r\n# Exibir os arrays em uma tabela\r\n\r\ngrouped_cones = Table().with_columns(\r\n    'Flavor', make_array('chocolate', 'strawberry'),\r\n    'Array of All the Prices', make_array(cones_choc, cones_strawb)\r\n)\r\n\r\n# Anexe uma coluna com a soma dos valores `Price` em cada array\r\n\r\nprice_totals = grouped_cones.with_column(\r\n    'Sum of the Array', make_array(sum(cones_choc), sum(cones_strawb))\r\n)\r\nprice_totals<\/span><\/code><\/pre>\n<table style=\"font-family: monospace;border-collapse: collapse;width: auto;margin-left: 1em\" border=\"1\">\n<thead>\n<tr style=\"background-color: #f0f0f0;border-bottom: 2px solid #ddd\">\n<th style=\"text-align: left;padding: 4px 8px\">Flavor<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">Array of All the Prices<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">Sum of the Array<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">chocolate<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">[4.75 6.55 5.25]<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">16.55<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">strawberry<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">[3.55 5.25]<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">8.8<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p>Voc\u00ea pode substituir <code>sum<\/code> por qualquer outra fun\u00e7\u00e3o que funcione em arrays. Por exemplo, voc\u00ea pode usar <code>max<\/code> para encontrar o maior pre\u00e7o em cada categoria:<\/p>\n<pre><code><span style=\"color: black\">cones.group('Flavor', max)<\/span><\/code><\/pre>\n<table style=\"font-family: monospace;border-collapse: collapse;width: auto;margin-left: 1em\" border=\"1\">\n<thead>\n<tr style=\"background-color: #f0f0f0;border-bottom: 2px solid #ddd\">\n<th style=\"text-align: left;padding: 4px 8px\">Flavor<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">Price max<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">chocolate<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">6.55<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">strawberry<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">5.25<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p>Mais uma vez, <code>group<\/code> cria matrizes de pre\u00e7os em cada categoria <code>Flavor<\/code>. Mas agora ele encontra o <code>max<\/code> de cada array:<\/p>\n<pre><code><span style=\"color: black\">price_maxes = grouped_cones.with_column(\r\n    'Max of the Array', make_array(max(cones_choc), max(cones_strawb))\r\n)\r\nprice_maxes<\/span><\/code><\/pre>\n<table style=\"font-family: monospace;border-collapse: collapse;width: auto;margin-left: 1em\" border=\"1\">\n<thead>\n<tr style=\"background-color: #f0f0f0;border-bottom: 2px solid #ddd\">\n<th style=\"text-align: left;padding: 4px 8px\">Flavor<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">Array of All the Prices<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">Max of the Array<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">chocolate<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">[4.75 6.55 5.25]<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">6.55<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">strawberry<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">[3.55 5.25]<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">5.25<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p>Na verdade, a chamada original para <code>group<\/code> com apenas um argumento tem o mesmo efeito que usar <code>len<\/code> como fun\u00e7\u00e3o e depois limpar a tabela.<\/p>\n<pre><code><span style=\"color: black\">lengths = grouped_cones.with_column(\r\n    'Length of the Array', make_array(len(cones_choc), len(cones_strawb))\r\n)\r\nlengths<\/span><\/code><\/pre>\n<table style=\"font-family: monospace;border-collapse: collapse;width: auto;margin-left: 1em\" border=\"1\">\n<thead>\n<tr style=\"background-color: #f0f0f0;border-bottom: 2px solid #ddd\">\n<th style=\"text-align: left;padding: 4px 8px\">Flavor<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">Array of All the Prices<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">Length of the Array<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">chocolate<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">[4.75 6.55 5.25]<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">3<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">strawberry<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">[3.55 5.25]<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">2<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<h2 id=\"exemplo-sal-rios-da-nba\">Exemplo: Sal\u00e1rios da NBA<\/h2>\n<p>A tabela <code>nba<\/code> cont\u00e9m dados sobre os jogadores da National Basketball Association de 2015-2016. J\u00e1 examinamos esses dados anteriormente. Lembre-se de que os sal\u00e1rios s\u00e3o medidos em milh\u00f5es de d\u00f3lares.<\/p>\n<pre><code><span style=\"color: black\">nba1 = Table.read_table(path_data + 'nba_salaries.csv')\r\nnba = nba1.relabeled(\"'15-'16 SALARY\", 'SALARY')\r\nnba<\/span><\/code><\/pre>\n<table style=\"font-family: monospace;border-collapse: collapse;width: auto;margin-left: 1em\" border=\"1\">\n<thead>\n<tr style=\"background-color: #f0f0f0;border-bottom: 2px solid #ddd\">\n<th style=\"text-align: left;padding: 4px 8px\">PLAYER<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">POSITION<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">TEAM<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">SALARY<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Paul Millsap<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">PF<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Atlanta Hawks<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">18.6717<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Al Horford<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">C<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Atlanta Hawks<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">12.0<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Tiago Splitter<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">C<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Atlanta Hawks<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">9.75625<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Jeff Teague<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">PG<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Atlanta Hawks<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">8.0<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Kyle Korver<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">SG<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Atlanta Hawks<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">5.74648<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Thabo Sefolosha<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">SF<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Atlanta Hawks<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">4.0<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Mike Scott<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">PF<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Atlanta Hawks<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">3.33333<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Kent Bazemore<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">SF<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Atlanta Hawks<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">2.0<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Dennis Schroder<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">PG<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Atlanta Hawks<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">1.7634<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Tim Hardaway Jr.<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">SG<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Atlanta Hawks<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">1.30452<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p><strong>1.<\/strong> Quanto dinheiro cada equipe pagou pelos sal\u00e1rios de seus jogadores?<\/p>\n<p>As \u00fanicas colunas envolvidas s\u00e3o <code>TEAM<\/code> e <code>SALARY<\/code>. Temos que <code>agrupar<\/code> as linhas por <code>TEAM<\/code> e depois <code>sum<\/code> (somar) os sal\u00e1rios dos grupos.<\/p>\n<pre><code><span style=\"color: black\">teams_and_money = nba.select('TEAM', 'SALARY')\r\nteams_and_money.group('TEAM', sum)<\/span><\/code><\/pre>\n<table style=\"font-family: monospace;border-collapse: collapse;width: auto;margin-left: 1em\" border=\"1\">\n<thead>\n<tr style=\"background-color: #f0f0f0;border-bottom: 2px solid #ddd\">\n<th style=\"text-align: left;padding: 4px 8px\">TEAM<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">SALARY sum<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Atlanta Hawks<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">69.5731<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Boston Celtics<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">50.2855<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Brooklyn Nets<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">57.307<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Charlotte Hornets<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">84.1024<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Chicago Bulls<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">78.8209<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Cleveland Cavaliers<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">102.312<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Dallas Mavericks<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">65.7626<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Denver Nuggets<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">62.4294<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Detroit Pistons<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">42.2118<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">Golden State Warriors<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">94.0851<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p><strong>2.<\/strong> Quantos jogadores da NBA havia em cada uma das cinco posi\u00e7\u00f5es?<\/p>\n<p>Temos que classificar por <code>POSITION<\/code> e contar. Isso pode ser feito com apenas um argumento para agrupar:<\/p>\n<pre><code><span style=\"color: black\">nba.group('POSITION')<\/span><\/code><\/pre>\n<table style=\"font-family: monospace;border-collapse: collapse;width: auto;margin-left: 1em\" border=\"1\">\n<thead>\n<tr style=\"background-color: #f0f0f0;border-bottom: 2px solid #ddd\">\n<th style=\"text-align: left;padding: 4px 8px\">POSITION<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">count<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">C<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">69<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">PF<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">85<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">PG<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">85<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">SF<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">82<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">SG<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">96<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p><strong>3.<\/strong> Qual foi o sal\u00e1rio m\u00e9dio dos jogadores em cada uma das cinco posi\u00e7\u00f5es?<\/p>\n<p>Desta vez temos que agrupar por <code>POSITION<\/code> e tirar a m\u00e9dia dos sal\u00e1rios. Para maior clareza trabalharemos com uma tabela apenas dos cargos e dos sal\u00e1rios.<\/p>\n<pre><code><span style=\"color: black\">positions_and_money = nba.select('POSITION', 'SALARY')\r\npositions_and_money.group('POSITION', np.mean)<\/span><\/code><\/pre>\n<table style=\"font-family: monospace;border-collapse: collapse;width: auto;margin-left: 1em\" border=\"1\">\n<thead>\n<tr style=\"background-color: #f0f0f0;border-bottom: 2px solid #ddd\">\n<th style=\"text-align: left;padding: 4px 8px\">POSITION<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">SALARY mean<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">C<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">6.08291<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">PF<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">4.95134<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">PG<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">5.16549<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">SF<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">5.53267<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">SG<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">3.9882<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p>O Center foi a posi\u00e7\u00e3o mais bem paga, com uma m\u00e9dia de mais de 6 milh\u00f5es de d\u00f3lares.<\/p>\n<p>Se n\u00e3o tiv\u00e9ssemos selecionado as duas colunas como nosso primeiro passo, <code>group<\/code> n\u00e3o tentaria &#8220;averiguar&#8221; as colunas categ\u00f3ricas em <code>nba<\/code>. (\u00c9 imposs\u00edvel calcular a m\u00e9dia de duas strings como &#8220;Atlanta Hawks&#8221; e &#8220;Boston Celtics&#8221;.) Ele realiza opera\u00e7\u00f5es aritm\u00e9ticas apenas em colunas num\u00e9ricas e deixa o resto em branco.<\/p>\n<pre><code><span style=\"color: black\">nba.group('POSITION', np.mean)<\/span><\/code><\/pre>\n<table style=\"font-family: monospace;border-collapse: collapse;width: auto;margin-left: 1em\" border=\"1\">\n<thead>\n<tr style=\"background-color: #f0f0f0;border-bottom: 2px solid #ddd\">\n<th style=\"text-align: left;padding: 4px 8px\">POSITION<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">PLAYER mean<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">TEAM mean<\/th>\n<th style=\"text-align: left;padding: 4px 8px\">SALARY mean<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">C<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\"><\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\"><\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">6.08291<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">PF<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\"><\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\"><\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">4.95134<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">PG<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\"><\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\"><\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">5.16549<\/td>\n<\/tr>\n<tr style=\"background-color: #f8f8f8\">\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">SF<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\"><\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\"><\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">5.53267<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">SG<\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\"><\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\"><\/td>\n<td style=\"padding: 4px 8px;border: 1px solid #ddd\">3.9882<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p><!--###########################################################################################################################################################--><\/p>\n<table width=\"100%\">\n<tbody>\n<tr>\n<td align=\"left\"><a class=\"next-page-link\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/8-0\/8-1\/\">\u2190 Cap\u00edtulo 8.1 &#8211; Aplicando uma Fun\u00e7\u00e3o a uma Coluna<\/a><\/td>\n<td align=\"right\"><a class=\"next-page-link\" href=\"https:\/\/literaciadigital.ufms.br\/data8\/8-0\/8-3\/\">Cap\u00edtulo 8.3 &#8211; Classifica\u00e7\u00e3o Cruzada \u2192<\/a><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><!--###########################################################################################################################################################--><\/p>\n<\/div>\n<\/div>\n<div style=\"clear: both;height: 1px;margin-top: -1px\"><\/div>\n","protected":false},"excerpt":{"rendered":"<p>\u00cdndice 1. O que \u00e9 Ci\u00eancia de Dados? 1.1. Introdu\u00e7\u00e3o 1.1.1. Ferramentas Computacionais 1.1.2. T\u00e9cnicas Estat\u00edsticas 1.2. Por que Ci\u00eancia de Dados? 1.3. Tra\u00e7ando os Cl\u00e1ssicos 1.3.1. Personagens Liter\u00e1rios 1.3.2. Outro Tipo de Personagem 2. Causalidade e Experimentos 2.1. John Snow e a Bomba da Broad Street 2.2. O &#8220;Grande Experimento&#8221; de Snow 2.3. Estabelecendo [&hellip;]<\/p>\n","protected":false},"author":21894,"featured_media":0,"parent":484,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"page-templates\/full-width.php","meta":{"footnotes":""},"coauthors":[14],"class_list":["post-542","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/literaciadigital.ufms.br\/en\/wp-json\/wp\/v2\/pages\/542","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/literaciadigital.ufms.br\/en\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/literaciadigital.ufms.br\/en\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/literaciadigital.ufms.br\/en\/wp-json\/wp\/v2\/users\/21894"}],"replies":[{"embeddable":true,"href":"https:\/\/literaciadigital.ufms.br\/en\/wp-json\/wp\/v2\/comments?post=542"}],"version-history":[{"count":6,"href":"https:\/\/literaciadigital.ufms.br\/en\/wp-json\/wp\/v2\/pages\/542\/revisions"}],"predecessor-version":[{"id":974,"href":"https:\/\/literaciadigital.ufms.br\/en\/wp-json\/wp\/v2\/pages\/542\/revisions\/974"}],"up":[{"embeddable":true,"href":"https:\/\/literaciadigital.ufms.br\/en\/wp-json\/wp\/v2\/pages\/484"}],"wp:attachment":[{"href":"https:\/\/literaciadigital.ufms.br\/en\/wp-json\/wp\/v2\/media?parent=542"}],"wp:term":[{"taxonomy":"author","embeddable":true,"href":"https:\/\/literaciadigital.ufms.br\/en\/wp-json\/wp\/v2\/coauthors?post=542"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}