<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "http://jats.nlm.nih.gov/publishing/1.0/JATS-journalpublishing1.dtd">
<article article-type="research-article" dtd-version="1.0" specific-use="sps-1.8" xml:lang="en"
	xmlns:mml="http://www.w3.org/1998/Math/MathML"
	xmlns:xlink="http://www.w3.org/1999/xlink">
	<front>
		<journal-meta>
			<journal-id journal-id-type="publisher-id">rsocp</journal-id>
			<journal-title-group>
				<journal-title>Revista de Sociologia e Pol&#xED;tica</journal-title>
				<abbrev-journal-title abbrev-type="publisher">Rev. Sociol. Polit.</abbrev-journal-title>
			</journal-title-group>
			<issn pub-type="ppub">0104-4478</issn>
			<issn pub-type="epub">1678-9873</issn>
			<publisher>
				<publisher-name>Universidade Federal do Paran&#xE1;</publisher-name>
			</publisher>
		</journal-meta>
		<article-meta>
			<article-id pub-id-type="publisher-id">00006</article-id>
			<article-id pub-id-type="doi">10.1590/1678-987320287406en</article-id>
			<article-id pub-id-type="other">00207</article-id>
			<article-categories>
				<subj-group subj-group-type="heading">
					<subject>Original Articles</subject>
				</subj-group>
			</article-categories>
			<title-group>
				<article-title>Read this paper if you want to learn logistic regression</article-title>
				<trans-title-group xml:lang="en">
					<trans-title>Leia este artigo se voc&#x00EA; quiser aprender regress&#x00E3;o log&#x00ED;stica</trans-title>
				</trans-title-group>
			</title-group>
			<contrib-group>
				<contrib contrib-type="author">
					<contrib-id contrib-id-type="orcid">0000-0002-0249-517X</contrib-id>
					<name>
						<surname>Fernandes</surname>
						<given-names>Ant&#x00F4;nio Alves T&#x00F4;rres</given-names>
					</name>
					<xref ref-type="aff" rid="aff1">
						<sup>I</sup>
					</xref>
					<xref ref-type="corresp" rid="c1"/>
					<email>antonio.alvestorres@ufpe.br</email>
				</contrib>
				<contrib contrib-type="author">
					<contrib-id contrib-id-type="orcid">0000-0001-6982-2262</contrib-id>
					<name>
						<surname>Figueiredo</surname>
						<given-names>Dalson Britto</given-names>
						<suffix>Filho</suffix>
					</name>
					<xref ref-type="aff" rid="aff1">
						<sup>I</sup>
					</xref>
					<xref ref-type="corresp" rid="c2"/>
					<email>dalson.figueiredofo@ufpe.br</email>
				</contrib>
				<contrib contrib-type="author">
					<contrib-id contrib-id-type="orcid">0000-0003-1717-523X</contrib-id>
					<name>
						<surname>Rocha</surname>
						<given-names>Enivaldo Carvalho da</given-names>
					</name>
					<xref ref-type="aff" rid="aff1">
						<sup>I</sup>
					</xref>
					<xref ref-type="corresp" rid="c3"/>
					<email>enivaldocrocha@gmail.com</email>
				</contrib>
				<contrib contrib-type="author">
					<contrib-id contrib-id-type="orcid">0000-0002-2257-8108</contrib-id>
					<name>
						<surname>Nascimento</surname>
						<given-names>Willber da Silva</given-names>
					</name>
					<xref ref-type="aff" rid="aff1">
						<sup>I</sup>
					</xref>
					<xref ref-type="corresp" rid="c4"/>
					<email>nascimentowillber@gmail.com</email>
				</contrib>
				<aff id="aff1">
					<sup>I</sup>
					<institution content-type="normalized">Federal University of Pernambuco</institution>
					<institution content-type="orgname">Federal University of Pernambuco</institution>
					<institution content-type="orgdiv1">Postgraduate Program in Political Science</institution>
					<addr-line>
						<named-content content-type="city">Recife</named-content>
						<named-content content-type="state">PE</named-content>
					</addr-line>
					<country country="BR">Brazil</country>
					<institution content-type="original">Postgraduate Program in Political Science, Federal University of Pernambuco, Recife, PE, Brazil.</institution>
				</aff>
			</contrib-group>
			<author-notes>
				<corresp id="c1">Ant&#x00F4;nio Alves T&#x00F4;rres Fernandes (antonio.alvestorres@ufpe.br) is a master&#x2019;s student in Political Science in the PPGCP/UFPE, an undergraduate student in Economics (UCB/DF), and a member of the Research Methods in Political Science Group (DCP/UFPE).</corresp>
				<corresp id="c2">Dalson Britto Figueiredo Filho (dalson.figueiredofo@ufpe.br) is professor at Political Science Graduate Program at Federal University of Pernambuco and author of the book &#x201C;Quantitative methods in Political Science&#x201D;, Editora InterSaberes.</corresp>
				<corresp id="c3">Enivaldo Carvalho da Rocha (enivaldocrocha@gmail.com) is a retired full professor at Political Science Graduate Program at Federal University of Pernambuco.</corresp>
				<corresp id="c4">Willber da Silva Nascimento (nascimentowillber@gmail.com) holds a PhD in Political Science from UFPE and works as postdoctoral researcher at PPGCP/UFPE/FACEPE.</corresp>
			</author-notes>
			<!--pub-date date-type="pub" publication-format="electronic">
				<day>19</day>
				<month>12</month>
				<year>2020</year>
			</pub-date>
			<pub-date date-type="collection" publication-format="electronic"-->
				<pub-date pub-type="epub-ppub">
				<season>Oct-Dec</season>
				<year>2020</year>
			</pub-date>
			<volume>28</volume>
			<issue>74</issue>
			<elocation-id>006a</elocation-id>
			<history>
				<date date-type="received">
					<day>19</day>
					<month>10</month>
					<year>2019</year>
				</date>
				<date date-type="accepted">
					<day>07</day>
					<month>05</month>
					<year>2020</year>
				</date>
				<date date-type="received">
					<day>16</day>
					<month>05</month>
					<year>2020</year>
				</date>
			</history>
			<permissions>
				<copyright-year>2020</copyright-year>
				<license xml:lang="en" license-type="open-access" xlink:href="http://creativecommons.org/licenses/by-nc/4.0">
					<license-p>This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License which permits unrestricted non-commercial use, distribution, and reproduction in any medium provided the original work is properly cited.</license-p>
				</license>
			</permissions>
			<abstract>
				<title>ABSTRACT</title>
				<sec>
					<title>Introduction:</title>
					<p>What if my response variable is binary categorical? This paper provides an intuitive introduction to logistic regression, the most appropriate statistical technique to deal with dichotomous dependent variables.</p>
				</sec>
				<sec>
					<title>Materials and Methods:</title>
					<p>we estimate the effect of corruption scandals on the chance of reelection of candidates running for the Brazilian Chamber of Deputies using data from 
						<xref ref-type="bibr" rid="B4">Castro and Nunes (2014)</xref>. Specifically, we show the computational implementation in R and we explain the substantive interpretation of the results.
					</p>
				</sec>
				<sec>
					<title>Results:</title>
					<p>we share replication materials which quickly enables students and professionals to use the procedures presented here for their studying and research activities.</p>
				</sec>
				<sec>
					<title>Discussion:</title>
					<p>we hope to facilitate the use of logistic regression and to spread replication as a data analysis teaching tool.</p>
				</sec>
			</abstract>
			<trans-abstract xml:lang="en">
				<title>RESUMO</title>
				<sec>
					<title>Introdu&#x00E7;&#x00E3;o:</title>
					<p>E se a minha vari&#x00E1;vel resposta for categ&#x00F3;rica bin&#x00E1;ria? Este artigo apresenta uma introdu&#x00E7;&#x00E3;o intuitiva &#x00E0; regress&#x00E3;o log&#x00ED;stica, t&#x00E9;cnica estat&#x00ED;stica mais adequada para lidar com vari&#x00E1;veis dependentes dicot&#x00F4;micas.</p>
				</sec>
				<sec>
					<title>Materiais e M&#x00E9;todos:</title>
					<p>estimamos o efeito dos esc&#x00E2;ndalos de corrup&#x00E7;&#x00E3;o sobre a chance de reelei&#x00E7;&#x00E3;o de candidatos concorrentes a deputado federal no Brasil a partir dos dados de Castro e Nunes (2014). Em particular, mostramos a implementa&#x00E7;&#x00E3;o computacional no R e explicamos a interpreta&#x00E7;&#x00E3;o substantiva dos resultados.</p>
				</sec>
				<sec>
					<title>Resultados:</title>
					<p>disponibilizamos todos os materiais de replica&#x00E7;&#x00E3;o, permitindo que estudantes e profissionais utilizem os procedimentos discutidos aqui em suas atividades de estudo e pesquisa.</p>
				</sec>
				<sec>
					<title>Discuss&#x00E3;o:</title>
					<p>esperamos incentivar o uso da regress&#x00E3;o log&#x00ED;stica e difundir a replicabilidade como ferramenta de ensino de an&#x00E1;lise de dados.</p>
				</sec>
			</trans-abstract>
			<kwd-group xml:lang="en">
				<title>KEYWORDS:</title>
				<kwd>regression</kwd>
				<kwd>logistic regression</kwd>
				<kwd>replication</kwd>
				<kwd>quantitative methods</kwd>
				<kwd>transparency</kwd>
			</kwd-group>
			<kwd-group xml:lang="pt">
				<title>PALAVRAS-CHAVE:</title>
				<kwd>regress&#x00E3;o</kwd>
				<kwd>regress&#x00E3;o log&#x00ED;stica</kwd>
				<kwd>replica&#x00E7;&#x00E3;o</kwd>
				<kwd>m&#x00E9;todos quantitativos</kwd>
				<kwd>transpar&#x00EA;ncia</kwd>
			</kwd-group>
			<counts>
				<fig-count count="5"/>
				<table-count count="12"/>
				<equation-count count="2"/>
				<ref-count count="70"/>
			</counts>
		</article-meta>
	</front>
	<body>
		<sec sec-type="intro">
			<title>I. Introduction
				<xref ref-type="fn" rid="fn1">
					<sup>1</sup>
				</xref>
			</title>
			<p>The least squares linear model (OSL) is one of the most used tools in Political Science (
				<xref ref-type="bibr" rid="B41">Kruger &amp; Lewis-Beck, 2008)</xref>. As long as its assumptions are respected, the estimated coefficients from a random sample give the best linear unbiased estimator of the population&#x2019;s parameters (
				<xref ref-type="bibr" rid="B35">Kennedy, 2005</xref>). Unbiased because it does not systematically over or underestimates the parameter&#x2019;s value and because it gives the smallest variance among all possible estimates (
				<xref ref-type="bibr" rid="B43">Lewis-Beck, 1980</xref>).
			</p>
			<p>What about when assumptions are violated? In that case, we must adopt techniques better suited to the nature of the data. For instance, imagine a study that investigates the impact of campaign spending on the chance of a candidate being elected or not. Since the dependent variable is binary, some assumptions of the least squares model are violated (homoscedasticity, linearity, and normality) and the estimates may be inconsistent. A logistic regression is the best tool to handle dichotomous dependent variables, that is, when 
				<italic>y</italic> can only take on two categories: elected or not-elected; adopted the policy or did not adopt the policy; voted for president Bolsonaro or not. 
				<xref ref-type="bibr" rid="B45">Lottes, DeMaris, and Adler (1996)</xref> argue that, despite logistic regression&#x2019;s popularity in the Social Sciences, there is still a lot of confusion regarding its correct use. Given our pedagogical experience, this difficulty is explained by the lack of intuitive teaching materials. Moreover, many undergraduate and graduate programs, as well as textbooks, end their content at linear regression, shortening the dissemination of other data analysis techniques.
			</p>
			<p>To fill this gap, this paper presents an introduction to logistic regression. Our goal is to facilitate the understanding of its practical application. As far as audience, we write to students in the early stages of training and teachers who need materials for quantitative methods courses. Methodologically, we reproduce data from 
				<xref ref-type="bibr" rid="B4">Castro and Nunes (2014)</xref> regarding the relationship between involvement in corruption scandals (
				<italic>Mensal&#x00E3;o</italic>
				<xref ref-type="fn" rid="fn2">
					<sup>2</sup>
				</xref> and 
				<italic>Sanguessugas</italic>
				<xref ref-type="fn" rid="fn3">
					<sup>3</sup>
				</xref> scandals) and the reelection chances for candidates running for federal deputy in Brazil in 2006. All data and scripts are available at 
				<italic>Open Science Framework</italic> (OSF)
				<xref ref-type="fn" rid="fn4">
					<sup>4</sup>
				</xref> website.
			</p>
			<p>By the end, the reader should be able to identify when a logistic regression should be used, computationally implement the model, and interpret the results. We are aware that this paper does not replace a detailed reading of primary sources on the subject and more technical materials. Nevertheless, we hope to make understanding logistic regression easier to you and to disseminate replicability as data analysis teaching tool.</p>
			<p>The remainder of the paper is divided as follows: the next section explains the underlying features logistic regression. The third identifies the main technical conditions that must be met to ensure that the model&#x2019;s estimates are consistent. The fourth section describes the main statistics that must be observed. Lastly, we provide some recommendations on how to improve the quality of methodological training offered to Political Science undergraduate and graduate students in Brazil.</p>
		</sec>
		<sec>
			<title>II. The logic of logistic regression
				<xref ref-type="fn" rid="fn5">
					<sup>5</sup>
				</xref>
			</title>
			<p>The use of binary categorical dependent variables is common in Political Science empirical research. For example: voted or not (Nicolau, 2007; 
				<xref ref-type="bibr" rid="B63">Soares, 2000</xref>), won or lost the electoral contest (
				<xref ref-type="bibr" rid="B64">Speck &amp; Mancuso, 2013</xref>; 
				<xref ref-type="bibr" rid="B57">Peixoto, 2009</xref>), adhered to the policy or not (
				<xref ref-type="bibr" rid="B21">Furlong, 1998</xref>), democracy or not-democracy (
				<xref ref-type="bibr" rid="B24">Goldsmith, Chalup &amp; Quinlan, 2008</xref>), started a war or not (
				<xref ref-type="bibr" rid="B28">Henderson &amp; Singer, 2000</xref>), appealed a judicial ruling or not (
				<xref ref-type="bibr" rid="B11">Epstein, Landes &amp; Posner, 2013</xref>). For all these situations, a logistic regression is the best suited technique to model the dependent variable&#x2019;s variation given a set of independent variables.
			</p>
			<p>In a logistic regression, the dependent variable only has two categories
				<xref ref-type="fn" rid="fn6">
					<sup>6</sup>
				</xref>. Generally, the occurrence of the event is coded as 1 and its absence as 0. Keeping in mind that codification changes the coefficients&#x2019; signal and, therefore, their substantive interpretation. To better understand how a logistic regression works, it is necessary to understand the logic of regression analysis as a whole. Let&#x2019;s look at the linear model&#x2019;s classic notation:
			</p>
			<disp-formula id="eq1">
				<label>(1)</label>
				<mml:math id="m1" display="block">
					<mml:mrow>
						<mml:mtext>Y</mml:mtext>
						<mml:mo>=</mml:mo>
						<mml:mtext>&#x03B1;</mml:mtext>
						<mml:mo>+</mml:mo>
						<mml:mtext>&#x03B2;X</mml:mtext>
						<mml:mo>+</mml:mo>
						<mml:mtext>&#x03B5;</mml:mtext>
					</mml:mrow>
				</mml:math>
			</disp-formula>
			<p>Y represents the dependent variable, that is, what we are trying to understand/explain/predict. X represents the independent variable. The intercept, (&#x03B1;), represents the value of Y when X equals zero. The regression coefficient, (&#x03B2;), represents the variation observed in Y associated with the increase of one unit of X. The stochastic term, (&#x03B5;), represents the error of the model. Technically, it is possible to estimate if there is a linear relationship between a dependent variable (Y) and different independent variables. Moreover, the model allows the observation of the effect magnitude and to test the coefficients&#x2019; statistical significance (p-value and confidence intervals).</p>
			<p>A logistic regression can be interpreted as a particular case of generalized linear models (GLM)
				<xref ref-type="fn" rid="fn7">
					<sup>7</sup>
				</xref>, in which the dependent variable is dichotomous. Figure&nbsp;1 compares the linear and logistic models.
			</p>
			<fig id="f1">
				<label>Figure 1</label>
				<caption>
					<title>Linear regression line versus logistic curve</title>
				</caption>
				<graphic xlink:href="figura1.jpg"/>
			</fig>
			<p>Source: The authors, based on Hair, A. 
				<italic>et al.</italic> (2019).
			</p>
			<p>Because the dependent variable in the logistic model takes on only two values (0 or 1), the probability predicted by the model must also be limited to that interval. When X (independent variable) takes on lower values, the probability approaches zero. Conversely, as X increases, the probability approaches 1. For 
				<xref ref-type="bibr" rid="B40">Kleibaum and Klein (2010)</xref>, that logistic functions vary between 0 and 1 explains the model&#x2019;s popularity. Given that the dependent variable&#x2019;s binary nature violates some the linear model&#x2019;s assumptions (homoscedasticity
				<xref ref-type="fn" rid="fn8">
					<sup>8</sup>
				</xref>, linearity
				<xref ref-type="fn" rid="fn9">
					<sup>9</sup>
				</xref>, normality), using a linear model to analyze binary variables may generate inefficient and biased coefficients
				<xref ref-type="fn" rid="fn10">
					<sup>10</sup>
				</xref>. To better understand the relationship between linear and logistic models, we reproduced the data from 
				<xref ref-type="bibr" rid="B30">Hosmer, Lemeshow, and Sturdivant (2013)</xref> on the association between age and coronary disease (Graph 1)
				<xref ref-type="fn" rid="fn11">
					<sup>11</sup>
				</xref>.
			</p>
			<fig id="f2">
				<label>Graph 1</label>
				<caption>
					<title>Age x coronary disease</title>
				</caption>
				<graphic xlink:href="figura2.jpg"/>
			</fig>
			<p>Source: The authors based on and 
				<xref ref-type="bibr" rid="B30">Hosmer, Lemeshow, and Sturdivant (2013)</xref>.
			</p>
			<p>The vertical dashed line represents the age mean: 44,38 years old. The cases were coded as 1 (developed coronary disease) and 0 (did not develop it). The trend is very clear: as age increases, the amount of people diagnosed with coronary disease grows. An intuitive way to observe this pattern is to examine the number of cases using the mean as a parameter for comparison. For example, for people above the mean there more illness cases, while for people below the mean, the larger concentration is in the &#x201C;did not develop it&#x201D; category. That is, the graph is stating that there is an association between age and coronary disease. It is in that sense that a logistic regression informs the probability of the event coded as 1 occurring, in the case at hand, developing coronary disease. 
				<xref ref-type="table" rid="t1">Table 1</xref> presents the data by age group.
			</p>
			<table-wrap id="t1">
				<label>Table 1</label>
				<caption>
					<title>Age group x coronary disease</title>
				</caption>
				<table frame="hsides" rules="groups">
					<colgroup width="20%">
						<col/>
						<col/>
						<col/>
						<col/>
						<col/>
					</colgroup>
					<thead style="border-top: thin solid; border-bottom: 2px solid; border-color: #000000">
						<tr>
							<th align="left" valign="top"></th>
							<th align="center" valign="top"></th>
							<th align="center" valign="top" colspan="2" style="border-bottom: thin solid;">Disease</th>
							<th align="center" valign="top"></th>
						</tr>
						<tr>
							<th align="left" valign="top">Age Group</th>
							<th align="center" valign="top">N</th>
							<th align="center" valign="top">Yes</th>
							<th align="center" valign="top">No</th>
							<th align="center" valign="top">Yes (%)</th>
						</tr>
					</thead>
					<tbody style="border-bottom: thin solid; border-color: #000000">
						<tr>
							<td align="left" valign="top">20-29</td>
							<td align="center" valign="top">10</td>
							<td align="center" valign="top">1</td>
							<td align="center" valign="top">9</td>
							<td align="center" valign="top">0.1</td>
						</tr>
						<tr>
							<td align="left" valign="top">30-34</td>
							<td align="center" valign="top">15</td>
							<td align="center" valign="top">2</td>
							<td align="center" valign="top">13</td>
							<td align="center" valign="top">0.13</td>
						</tr>
						<tr>
							<td align="left" valign="top">35-39</td>
							<td align="center" valign="top">12</td>
							<td align="center" valign="top">3</td>
							<td align="center" valign="top">9</td>
							<td align="center" valign="top">0.25</td>
						</tr>
						<tr>
							<td align="left" valign="top">40-44</td>
							<td align="center" valign="top">15</td>
							<td align="center" valign="top">5</td>
							<td align="center" valign="top">10</td>
							<td align="center" valign="top">0.33</td>
						</tr>
						<tr>
							<td align="left" valign="top">45-49</td>
							<td align="center" valign="top">13</td>
							<td align="center" valign="top">6</td>
							<td align="center" valign="top">7</td>
							<td align="center" valign="top">0.46</td>
						</tr>
						<tr>
							<td align="left" valign="top">50-54</td>
							<td align="center" valign="top">8</td>
							<td align="center" valign="top">5</td>
							<td align="center" valign="top">3</td>
							<td align="center" valign="top">0.63</td>
						</tr>
						<tr>
							<td align="left" valign="top">55-59</td>
							<td align="center" valign="top">17</td>
							<td align="center" valign="top">13</td>
							<td align="center" valign="top">4</td>
							<td align="center" valign="top">0.76</td>
						</tr>
						<tr>
							<td align="left" valign="top">60-69</td>
							<td align="center" valign="top">10</td>
							<td align="center" valign="top">8</td>
							<td align="center" valign="top">2</td>
							<td align="center" valign="top">0.8</td>
						</tr>
						<tr>
							<td align="left" valign="top">
								<bold>Total</bold>
							</td>
							<td align="center" valign="top">
								<bold>100</bold>
							</td>
							<td align="center" valign="top">
								<bold>43</bold>
							</td>
							<td align="center" valign="top">
								<bold>57</bold>
							</td>
							<td align="center" valign="top"></td>
						</tr>
					</tbody>
				</table>
				<table-wrap-foot>
					<attrib>Source: The authors, based on 
						<xref ref-type="bibr" rid="B30">Hosmer, Lemeshow, and Sturdivant (2013)</xref>
					</attrib>
				</table-wrap-foot>
			</table-wrap>
			<p>Simply observe the last column to reach the same conclusion presented by Graph 1: the higher the age, the higher the chance to develop coronary diseases. An additional option to visualize the relationship between these variables is to graphically represent the percentage of people who are ill for each age group (Graph 2).</p>
			<fig id="f3">
				<label>Graph 2</label>
				<caption>
					<title>Age group x coronary disease</title>
				</caption>
				<graphic xlink:href="figura3.jpg"/>
			</fig>
			<p>Source: The authors based on 
				<xref ref-type="bibr" rid="B30">Hosmer, Lemeshow, and Sturdivant (2013)</xref>.
			</p>
			<p>We observe a positive correlation between age (axis X) and the probability to develop cardiac diseases (axis Y) is observed. A logistic regression will inform the direction, magnitude, and the statistical significance level of this relationship. In a nutshell, the researcher must use a logistic regression when the dependent variable is categorical and binary. Given that many variables in the Humanities are categorical, the analytical benefits associated with the correct application and interpretation of a logistic regression are evident
				<xref ref-type="fn" rid="fn12">
					<sup>12</sup>
				</xref>.
			</p>
		</sec>
		<sec>
			<title>III. Planning a logistic regression</title>
			<p>
				<xref ref-type="table" rid="t2">Table 2</xref> describes the five stages that should be observed.
			</p>
			<table-wrap id="t2">
				<label>Table 2</label>
				<caption>
					<title>Planning a logistic regression in five stages</title>
				</caption>
				<table frame="hsides" rules="groups">
					<colgroup width="50%">
						<col/>
						<col/>
					</colgroup>
					<thead style="border-top: thin solid; border-bottom: 2px solid; border-color: #000000">
						<tr>
							<th align="left" valign="top">Stage</th>
							<th align="left" valign="top">Description</th>
						</tr>
					</thead>
					<tbody style="border-bottom: thin solid; border-color: #000000">
						<tr>
							<td align="left" valign="top">1
								<sup>st</sup>
							</td>
							<td align="left" valign="top">Identify the dependent variable</td>
						</tr>
						<tr>
							<td align="left" valign="top">2
								<sup>nd</sup>
							</td>
							<td align="left" valign="top">Note the technical requirements</td>
						</tr>
						<tr>
							<td align="left" valign="top">3
								<sup>rd</sup>
							</td>
							<td align="left" valign="top">Estimate and fit the model</td>
						</tr>
						<tr>
							<td align="left" valign="top">4
								<sup>th</sup>
							</td>
							<td align="left" valign="top">Interpret the results</td>
						</tr>
						<tr>
							<td align="left" valign="top">5
								<sup>th</sup>
							</td>
							<td align="left" valign="top">Validate the results</td>
						</tr>
					</tbody>
				</table>
				<table-wrap-foot>
					<attrib>Source: the authors, based on 
						<xref ref-type="bibr" rid="B27">Hair 
							<italic>et al.</italic> (2009)
						</xref>.
					</attrib>
				</table-wrap-foot>
			</table-wrap>
			<p>The first stage is to identify a research question for which the dependent variable is naturally dichotomous. For example, given the popularity of logistic regression in health research, commonly used variables are: lived/died; sick/not sick; smoker/ non-smoker. Usually, a researcher must forgo from recoding a continuous or discrete variable into a dichotomous categorical one. More clearly, let&#x2019;s say the interest variable is income per capita. It is wrong to recode income to produce two categories: rich versus poor. Technically, recoding a quantitative variable into a categorical one implies loss of information and that reduces the estimates&#x2019; consistency (
				<xref ref-type="bibr" rid="B12">Fernandes 
					<italic>et al.</italic>, 2019
				</xref>)
				<xref ref-type="fn" rid="fn13">
					<sup>13</sup>
				</xref>.
			</p>
			<p>At the second stage, the technical requirements must be observed. Despite being more flexible than other statistical techniques, logistic regression is sensitive to, for example, problems of multicollinearity (high levels of correlation between independent variables)
				<xref ref-type="fn" rid="fn14">
					<sup>14</sup>
				</xref>. There are different procedures to minimize this problem. The simplest is to increase the number of observations (
				<xref ref-type="bibr" rid="B35">Kennedy, 2005</xref>). An additional option is to use some data reduction technique to create a synthetic measure from the variance of the original variables. We must not simply exclude one of the independent variables, under the risk of producing errors in the model specification. In a logistic regression, the size of the sample is key (Hair 
				<italic>et al.</italic>, 2009). Small samples tend to produce inconsistent estimates. On the other hand, excessively large samples increase the power of statistical tests in such a way that any effect tends to be statistically significant, regardless of magnitude. 
				<xref ref-type="bibr" rid="B31">Hosmer and Lemeshow (2000)</xref> suggest a minimal 
				<italic>n</italic> of 400 cases. 
				<xref ref-type="bibr" rid="B27">Hair 
					<italic>et al.</italic> (2009)
				</xref> suggest a ratio of 10 cases for each independent variable included in the model. 
				<xref ref-type="bibr" rid="B56">Pedhazur (1982)</xref> recommends a ratio of 30 cases for each estimated parameter.
			</p>
			<p>Another eventual source for problems is outliers. Extreme cases produce disastrous results in data analysis and in the case of a logistic regression, the presence of atypical observations may harm the model&#x2019;s fit. Once aberrant cases are detected, a researcher must decide what to do with them. Sometimes an extreme case is nothing more than a typo and can be easily solved. One option is to exclude outliers from the model&#x2019;s estimation and measure the impact of its inclusion on the coefficients. Another procedure commonly adopted is to recode the case, giving it a less extreme value, the mean for example. In any case, it is important to describe in detail what was done to deal with eventual extreme observations
				<xref ref-type="fn" rid="fn15">
					<sup>15</sup>
				</xref>.
			</p>
			<p>At stage three, the researcher must estimate the model. Here, two procedures are essential: a) report the software and b) and share replication materials, which include the original data, the manipulated data, and the computational scripts
				<xref ref-type="fn" rid="fn16">
					<sup>16</sup>
				</xref>. These procedures increase transparency and make replicability of results easier (
				<xref ref-type="bibr" rid="B38">King, 1995</xref>; 
				<xref ref-type="bibr" rid="B54">Paranhos 
					<italic>et al.</italic>, 2013
				</xref>; 
				<xref ref-type="bibr" rid="B33">Janz, 2016</xref>; 
				<xref ref-type="bibr" rid="B12">Figueiredo Filho 
					<italic>et al.</italic>, 2019
				</xref>). After estimating the model, the next step is evaluating the goodness of the fit. This can be done by comparing the null model (just the intercept) with the model that incorporates the independent variables. A statistically significant difference between the models indicates that the explanatory variables help to predict the occurrence of the dependent variable. 
				<xref ref-type="fig" rid="f2">Figure 2</xref> shows the underlying logic of model comparison when we are using logistic regression.
			</p>
			<fig id="f4">
				<label>Figure 2</label>
				<caption>
					<title>Comparing the fit of logistic models</title>
				</caption>
				<graphic xlink:href="figura4.jpg"/>
			</fig>
			<p>Source: 
				<xref ref-type="bibr" rid="B27">Hair 
					<italic>et al.</italic> (2009)
				</xref>.
			</p>
			<p>Comparatively, model B has a better fit than model A. This can be observed given the difference in discriminatory power. While model A presents high variability, model B is more precise. For Tabachnick, Fidell, and Ullman,</p>
			<disp-quote>
				<p>[...] &#x201C;logistic regression, like multiway frequency analysis, can be used to fit and compare models. The simplest (and worst-fitting) model includes only the constant and none of the predictors. The most complex (and &#x2018;best&#x2019;-fitting) model includes the constant, all predictors, and, perhaps, interactions among predictors. Often, however, not all predictors (and interactions) are related to the outcome. The researcher uses goodness-of-fit tests to choose the model that does the best job of prediction with the fewest predictors.&#x201D; (
					<xref ref-type="bibr" rid="B66">Tabachnick, Fidell &amp; Ullman, 2007</xref>, p. 439).
				</p>
			</disp-quote>
			<p>The fourth stage is the interpretation of results. Unfortunately, many works limit themselves to analyzing the statistical significance of the estimates and do not pay attention to the coefficients&#x2019; magnitude. We suggest that researchers interpret the coefficients and substantively discuss how results are related to the research hypothesis. Unlike a linear regression, in which coefficients are easy to interpret, the estimates produced in the logistic model are less intuitive
				<xref ref-type="fn" rid="fn17">
					<sup>17</sup>
				</xref>. This is because the logit transformation informs the independent variable&#x2019;s effect on the variation of the dependent variable&#x2019;s natural logarithm of the odds. For example, when considering a coefficient of 0.6, an increase of 0.6 units is expected in the logit of Y every time X increases by one unit. This approach&#x2019;s main disadvantage is its lack of intelligibility. To state that the amount in logit in
				<xref ref-type="fn" rid="fn18">
					<sup>18</sup>
				</xref> creased 0.6 units is not very intuitive and does not help to understand the relationship between the variables.
			</p>
			<p>A second possibility is to analyze the independent variables&#x2019; impact on the odds of Y. To do so, a researcher must get the exponent of the coefficient itself. In our example, the exponential of 0.6 is 1.82. This means that for each additional unit in X, an increase of 1.82 is expected in the chance of Y occurring, keeping other variables constant. Graph 3 illustrates the distribution of a simulation&#x2019;s exponential function, in which x varies between -5 and 5.</p>
			<fig id="f5">
				<label>Graph 3</label>
				<caption>
					<title>Exponential function</title>
				</caption>
				<graphic xlink:href="figura5.jpg"/>
			</fig>
			<p>Source: The authors, based on 
				<xref ref-type="bibr" rid="B30">Hosmer, Lemeshow, and Sturdivant (2013)</xref>.
			</p>
			<p>In a logistic regression, the exponential of a positive value (+) produces a coefficient larger than 1. Conversely, a negative coefficient (-) returns a Exp (&#x03B2;) smaller than 1. A coefficient with a value of zero produces an Exp (&#x03B2;) equal to 1, indicating that the independent variable does not affect the chance of the dependent variable&#x2019;s occurrence. So, write it down in your notebook: the farther the coefficient is from one, regardless of the direction, the greater the impact of a given independent variable on the chance of the event of interest occurring
				<xref ref-type="fn" rid="fn19">
					<sup>19</sup>
				</xref>.
			</p>
			<p>The third possibility is to estimate the percentage increase in the chance of the occurrence of Y. To do so, one must subtract one unit from the exponentiated regression coefficient and multiply the result by 100, in this case (1.82-1 * 100). Then we have that the increase in one unit of X is associated with an increase of 82% in the chance of Y occurring (
				<italic>ceteris paribus)</italic>. The interpretation of the logistic regression&#x2019;s coefficients may become a little more complicated when the chance is smaller than 1, that is, when the coefficient (&#x03B2;) is negative. One solution is to invert the coefficient (1/coefficient&#x2019;s value), which makes the interpretation easier. For example, a coefficient of 0.639, when inverted, indicates that when the independent variable decreases by in one unit, an average increase of 1.56 is expected in the chance of the dependent variable occurring.
			</p>
			<p>Lastly, the researcher must validate the results observed with a subsample of its original dataset. This procedure gives the research results more reliability, especially when working with small samples. According to 
				<xref ref-type="bibr" rid="B27">Hair 
					<italic>et al.</italic> (2009)
				</xref>,
			</p>
			<disp-quote>
				<p>&#x201C;the most common approach for establishing external validity is the assessment of hit ratios through either a separate sample (holdout sample) or utilizing a procedure that repeatedly processes the estimation sample. External validity is supported when the hit ratio of the selected approach exceeds the comparison standards that represent the predictive accuracy expected by chance.&#x201D; (Hair 
					<italic>et al.</italic>, 2014, p. 329).
				</p>
			</disp-quote>
			<p>Unfortunately, this procedure is rarely used by political scientists. We suspect that the reduced use of validation is in part explained by the lack of training on the specificities of logistic regression. The next section presents an applied example of logistic regression and explains how the results should be interpreted.</p>
		</sec>
		<sec>
			<title>IV. An applied example</title>
			<p>To illustrate the application of the logistic regression, we replicated the data from 
				<xref ref-type="bibr" rid="B4">Castro and Nunes (2014)</xref> on corruption and reelection
				<xref ref-type="fn" rid="fn20">
					<sup>20</sup>
				</xref>. However, since our focus is purely methodological, we will not explore the substantive meaning of the conclusions reported by the authors. According to the planning from the previous section, the first step is to identify the dependent variable that will take value &#x201C;1&#x201D; for candidates reelected in 2006 and &#x201C;0&#x201D; if otherwise
				<xref ref-type="fn" rid="fn21">
					<sup>21</sup>
				</xref>.
			</p>
			<p>The second step is to verify the technical requirements to estimate the logistic regression. During this step, it is important to observe the presence of outliers, the occurrence of high correlation between independent variables, and an adequate sample size. Due to space limitations, we will reproduce only one of the models presented by 
				<xref ref-type="bibr" rid="B4">Castro and Nunes (2014)</xref>. Specifically, the sample used to estimate model 5 from 
				<xref ref-type="table" rid="t6">Table 6</xref> (p. 41), which has a total of 217 observations and a proportion of 19 cases for each independent variable. We do not find deviant cases and the level of correlation between the variables included in the model is acceptable. Thus, we can move on to the next phase.
			</p>
			<p>The third stage consists of the model&#x2019;s estimation:
				<xref ref-type="fn" rid="fn22">
					<sup>22</sup>
				</xref>
			</p>
			<disp-formula id="eq2">
				<label>(2)</label>
				<mml:math id="m2" display="block">
					<mml:mrow>
						<mml:mtext>logit</mml:mtext>
						<mml:mo stretchy="false">(</mml:mo>
						<mml:mtext>Y</mml:mtext>
						<mml:mo stretchy="false">)</mml:mo>
						<mml:mo>=</mml:mo>
						<mml:mtext>&#x03B1;</mml:mtext>
						<mml:mo>+</mml:mo>
						<mml:msub>
							<mml:mtext>X</mml:mtext>
							<mml:mtext>1</mml:mtext>
						</mml:msub>
						<mml:msub>
							<mml:mtext>&#x03B2;</mml:mtext>
							<mml:mtext>1</mml:mtext>
						</mml:msub>
						<mml:mo>+</mml:mo>
						<mml:msub>
							<mml:mtext>X</mml:mtext>
							<mml:mtext>2</mml:mtext>
						</mml:msub>
						<mml:msub>
							<mml:mtext>&#x03B2;</mml:mtext>
							<mml:mtext>2</mml:mtext>
						</mml:msub>
						<mml:mo>+</mml:mo>
						<mml:msub>
							<mml:mtext>X</mml:mtext>
							<mml:mtext>3</mml:mtext>
						</mml:msub>
						<mml:msub>
							<mml:mtext>&#x03B2;</mml:mtext>
							<mml:mtext>3</mml:mtext>
						</mml:msub>
						<mml:mo>+</mml:mo>
						<mml:msub>
							<mml:mtext>X</mml:mtext>
							<mml:mtext>4</mml:mtext>
						</mml:msub>
						<mml:msub>
							<mml:mtext>&#x03B2;</mml:mtext>
							<mml:mtext>4</mml:mtext>
						</mml:msub>
						<mml:mo>+</mml:mo>
						<mml:msub>
							<mml:mtext>X</mml:mtext>
							<mml:mtext>5</mml:mtext>
						</mml:msub>
						<mml:msub>
							<mml:mtext>&#x03B2;</mml:mtext>
							<mml:mtext>5</mml:mtext>
						</mml:msub>
						<mml:mo>+</mml:mo>
						<mml:msub>
							<mml:mtext>X</mml:mtext>
							<mml:mtext>6</mml:mtext>
						</mml:msub>
						<mml:msub>
							<mml:mtext>&#x03B2;</mml:mtext>
							<mml:mtext>6</mml:mtext>
						</mml:msub>
						<mml:mo>+</mml:mo>
						<mml:msub>
							<mml:mtext>X</mml:mtext>
							<mml:mtext>7</mml:mtext>
						</mml:msub>
						<mml:msub>
							<mml:mtext>&#x03B2;</mml:mtext>
							<mml:mtext>7</mml:mtext>
						</mml:msub>
						<mml:mo>+</mml:mo>
						<mml:msub>
							<mml:mtext>X</mml:mtext>
							<mml:mtext>8</mml:mtext>
						</mml:msub>
						<mml:msub>
							<mml:mtext>&#x03B2;</mml:mtext>
							<mml:mtext>8</mml:mtext>
						</mml:msub>
						<mml:mo>+</mml:mo>
						<mml:msub>
							<mml:mtext>X</mml:mtext>
							<mml:mtext>9</mml:mtext>
						</mml:msub>
						<mml:msub>
							<mml:mtext>&#x03B2;</mml:mtext>
							<mml:mtext>9</mml:mtext>
						</mml:msub>
						<mml:mo>+</mml:mo>
						<mml:msub>
							<mml:mtext>X</mml:mtext>
							<mml:mrow>
								<mml:mtext>10</mml:mtext>
							</mml:mrow>
						</mml:msub>
						<mml:msub>
							<mml:mtext>&#x03B2;</mml:mtext>
							<mml:mrow>
								<mml:mtext>10</mml:mtext>
							</mml:mrow>
						</mml:msub>
						<mml:mo>+</mml:mo>
						<mml:msub>
							<mml:mtext>X</mml:mtext>
							<mml:mrow>
								<mml:mtext>11</mml:mtext>
							</mml:mrow>
						</mml:msub>
						<mml:msub>
							<mml:mtext>&#x03B2;</mml:mtext>
							<mml:mrow>
								<mml:mtext>11</mml:mtext>
							</mml:mrow>
						</mml:msub>
						<mml:mo>+</mml:mo>
						<mml:mtext>&#x03B5;</mml:mtext>
					</mml:mrow>
				</mml:math>
			</disp-formula>
			<p>
				<xref ref-type="table" rid="ch1">Chart 1</xref> summarizes how the variables were measured.
			</p>
			<table-wrap id="ch1">
				<label>Chart 1</label>
				<caption>
					<title>Variables measurement level</title>
				</caption>
				<table frame="hsides" rules="groups">
					<colgroup width="50%">
						<col/>
						<col/>
					</colgroup>
					<thead style="border-top: thin solid; border-bottom: 2px solid; border-color: #000000">
						<tr>
							<th align="left" valign="top">Variables</th>
							<th align="left" valign="top">Description</th>
						</tr>
					</thead>
					<tbody style="border-bottom: thin solid; border-color: #000000">
						<tr>
							<td align="left" valign="top">Sex (Control)</td>
							<td align="left" valign="top">
								<italic>Dummy:</italic> Female (0); Male (1)
							</td>
						</tr>
						<tr>
							<td align="left" valign="top">Age (Control)</td>
							<td align="left" valign="top">Continuous: age at election.</td>
						</tr>
						<tr>
							<td align="left" valign="top">Education (Control)</td>
							<td align="left" valign="top">Categorical ordinal: Read and write (0); Elementary School incomplete (1); Elementary School complete (2); High School incomplete (3); High School complete (4); Tertiary education incomplete (5); Tertiary Education (6).</td>
						</tr>
						<tr>
							<td align="left" valign="top">Poverty (Control)</td>
							<td align="left" valign="top">Continuous: percentage of poor people in the state.</td>
						</tr>
						<tr>
							<td align="left" valign="top">Ideology (Control)</td>
							<td align="left" valign="top">Categorical: Left (0); Center (1); Right (2).</td>
						</tr>
						<tr>
							<td align="left" valign="top">Vote Increase 2006 (Control)</td>
							<td align="left" valign="top">
								<italic>Dummy</italic>: Increased (1); Lowered (0).
							</td>
						</tr>
						<tr>
							<td align="left" valign="top">Change (Control)</td>
							<td align="left" valign="top">
								<italic>Dummy</italic>: Changed parties (1); Did not (0).
							</td>
						</tr>
						<tr>
							<td align="left" valign="top">Pork (Control)</td>
							<td align="left" valign="top">Continuous: success rate of execution of parliamentary amendments.</td>
						</tr>
						<tr>
							<td align="left" valign="top">Seats per state (Control)</td>
							<td align="left" valign="top">Continuous: number of seats for each state at the Chamber of Deputies.</td>
						</tr>
						<tr>
							<td align="left" valign="top">Expenditures (Control)</td>
							<td align="left" valign="top">Continuous: campaign expenditures</td>
						</tr>
						<tr>
							<td align="left" valign="top">Scandal (IV)</td>
							<td align="left" valign="top">
								<italic>Dummy:</italic> Involved in a scandal (1); Not involved in a scandal (0).
							</td>
						</tr>
						<tr>
							<td align="left" valign="top">Reelection (DV)</td>
							<td align="left" valign="top">
								<italic>Dummy:</italic> Reelected (1); Not-reelected (0).
							</td>
						</tr>
					</tbody>
				</table>
				<table-wrap-foot>
					<attrib>Source: the authors, based on Castro and Nunes (2014, p. 38-40).</attrib>
				</table-wrap-foot>
			</table-wrap>
			<p>We will test three hypotheses:</p>
			<p>
				<italic>H</italic>
				<sub>
					<italic>1</italic>
				</sub>: being involved in a corruption scandal reduces the probability of reelection;
			</p>
			<p>
				<italic>H</italic>
				<sub>
					<italic>2</italic>
				</sub>: the higher campaign spending, the higher the probability of reelection;
			</p>
			<p>
				<italic>H</italic>
				<sub>
					<italic>3</italic>
				</sub>: the higher the execution of amendments, the higher the probability of reelection.
			</p>
		</sec>
		<sec>
			<title>V. Results</title>
			<p>The first step is to analyze the distribution of the dependent variable. 
				<xref ref-type="table" rid="t3">Table 3</xref> summarizes this information.
			</p>
			<table-wrap id="t3">
				<label>Table 3</label>
				<caption>
					<title>Frequency distribution for the independent variable (reelected)</title>
				</caption>
				<table frame="hsides" rules="groups">
					<colgroup width="33%">
						<col/>
						<col/>
						<col/>
					</colgroup>
					<thead style="border-top: thin solid; border-bottom: 2px solid; border-color: #000000">
						<tr>
							<th align="left" valign="top">Reelected</th>
							<th align="center" valign="top">N</th>
							<th align="center" valign="top">%</th>
						</tr>
					</thead>
					<tbody style="border-bottom: thin solid; border-color: #000000">
						<tr>
							<td align="left" valign="top">Yes</td>
							<td align="center" valign="top">273</td>
							<td align="center" valign="top">60.53</td>
						</tr>
						<tr>
							<td align="left" valign="top">No</td>
							<td align="center" valign="top">178</td>
							<td align="center" valign="top">39.47</td>
						</tr>
						<tr>
							<td align="left" valign="top">
								<bold>Total</bold>
							</td>
							<td align="center" valign="top">
								<bold>451</bold>
							</td>
							<td align="center" valign="top">
								<bold>100.0</bold>
							</td>
						</tr>
					</tbody>
				</table>
				<table-wrap-foot>
					<attrib>Source: The authors.</attrib>
				</table-wrap-foot>
			</table-wrap>
			<p>There is information for 451 cases. From this total, 60.53% of the federal deputies were reelected in 2006, which means 273 occurrences
				<xref ref-type="fn" rid="fn23">
					<sup>23</sup>
				</xref>. We can say then that the probability for reelection is of 0.605. Alternatively, the chance of being reelected can be calculated by the division between the probabilities (yes/no), here, 0.605/0.395 = 1.53. 
				<xref ref-type="table" rid="t4">Table 4</xref> illustrates this information.
			</p>
			<table-wrap id="t4">
				<label>Table 4</label>
				<caption>
					<title>Comparison of reelection rate (involved x not-involved) (%)</title>
				</caption>
				<table frame="hsides" rules="groups">
					<colgroup width="25%">
						<col/>
						<col/>
						<col/>
						<col/>
					</colgroup>
					<thead style="border-top: thin solid; border-bottom: 2px solid; border-color: #000000">
						<tr>
							<th align="left" valign="top">Involved in a scandal</th>
							<th align="center" valign="top" colspan="2" style="border-bottom: thin solid;">Reelected</th>
							<th align="center" valign="top">Total</th>
						</tr>
						<tr>
							<th align="left" valign="top"></th>
							<th align="center" valign="top">Yes</th>
							<th align="center" valign="top">No</th>
							<th align="center" valign="top"></th>
						</tr>
					</thead>
					<tbody style="border-bottom: thin solid; border-color: #000000">
						<tr>
							<td align="left" valign="top">
								<bold>Yes</bold>
							</td>
							<td align="center" valign="top">10 (17.86)</td>
							<td align="center" valign="top">46 (82.14)</td>
							<td align="center" valign="top">
								<bold>56 (100.0)</bold>
							</td>
						</tr>
						<tr>
							<td align="left" valign="top">
								<bold>No</bold>
							</td>
							<td align="center" valign="top">263 (66.58)</td>
							<td align="center" valign="top">132 (33.42)</td>
							<td align="center" valign="top">
								<bold>395 (100.0)</bold>
							</td>
						</tr>
						<tr>
							<td align="left" valign="top">
								<bold>Total</bold>
							</td>
							<td align="center" valign="top">
								<bold>273 (60.53)</bold>
							</td>
							<td align="center" valign="top">
								<bold>178 (39.47)</bold>
							</td>
							<td align="center" valign="top">
								<bold>451 (100.0)</bold>
							</td>
						</tr>
					</tbody>
				</table>
				<table-wrap-foot>
					<attrib>Source: The authors.</attrib>
				</table-wrap-foot>
			</table-wrap>
			<p>Considering only candidates involved in corruption scandals, the reelection rate was 17.86%, since 10 out of 56 representatives got a new term
				<xref ref-type="fn" rid="fn24">
					<sup>24</sup>
				</xref>. This means that, for this group, the probability for reelection is 0.179 and the chance for reelection is 0.22. For the candidates not involved in corruption scandals, the chance of being reelected is 1.9. Ultimately, in our replication example, the logistic regression consists of the comparative analysis of the reelection percentage of candidates involved in corruption scandals and those not involved
				<xref ref-type="fn" rid="fn25">
					<sup>25</sup>
				</xref>.
			</p>
			<p>In terms of the model&#x2019;s general fit, one of the main tests used is the 
				<xref ref-type="bibr" rid="B31">Hosmer and Lemeshow (2000)</xref>. This test is considered more robust than a common chi-square, especially when there are continuous independent variables or when the sample&#x2019;s size is small (
				<xref ref-type="bibr" rid="B22">Garson, 2011</xref>). 
				<xref ref-type="table" rid="t5">Table 5</xref> summarizes the information of interest (value of the test, degrees of freedom, and statistical significance) for Hosmer and Lemeshow tests, and 
				<xref ref-type="table" rid="t6">Table 6</xref> shows the same for the Omnibus test of model coefficients.
			</p>
			<table-wrap id="t5">
				<label>Table 5</label>
				<caption>
					<title>Hosmer and Lemeshow Test</title>
				</caption>
				<table frame="hsides" rules="groups">
					<colgroup width="33%">
						<col/>
						<col/>
						<col/>
					</colgroup>
					<thead style="border-top: thin solid; border-bottom: 2px solid; border-color: #000000">
						<tr>
							<th align="left" valign="top">&#x03C7;
								<sup>2</sup>
							</th>
							<th align="center" valign="top">gl</th>
							<th align="center" valign="top">Sig</th>
						</tr>
					</thead>
					<tbody style="border-bottom: thin solid; border-color: #000000">
						<tr>
							<td align="left" valign="top">6.832</td>
							<td align="center" valign="top">8</td>
							<td align="center" valign="top">0.555</td>
						</tr>
					</tbody>
				</table>
				<table-wrap-foot>
					<attrib>Source: The authors.</attrib>
				</table-wrap-foot>
			</table-wrap>
			<table-wrap id="t6">
				<label>Table 6</label>
				<caption>
					<title>Omnibus test of model coefficients</title>
				</caption>
				<table frame="hsides" rules="groups">
					<colgroup width="33%">
						<col/>
						<col/>
						<col/>
					</colgroup>
					<thead style="border-top: thin solid; border-bottom: 2px solid; border-color: #000000">
						<tr>
							<th align="left" valign="top">&#x03C7;
								<sup>2</sup>
							</th>
							<th align="center" valign="top">gl</th>
							<th align="center" valign="top">Sig</th>
						</tr>
					</thead>
					<tbody style="border-bottom: thin solid; border-color: #000000">
						<tr>
							<td align="left" valign="top">56.356</td>
							<td align="center" valign="top">11</td>
							<td align="center" valign="top">0.000</td>
						</tr>
					</tbody>
				</table>
				<table-wrap-foot>
					<attrib>Source: The authors.</attrib>
				</table-wrap-foot>
			</table-wrap>
			<p>A non-significant result (p &gt; 0.05) suggests that the model estimated with the independent variables is better than the null model. The estimated model has a chi-square (&#x03C7;
				<sup>2</sup>) of 6.832 and a p-valor of 0.555, suggesting an adequate fit. Another commonly used adjustment measure is the Omnibus test of model coefficients. It is a chi-square test comparing the model&#x2019;s variance with the independent variables and the null model (just the intercept).
			</p>
			<p>Unlike the Hosmer and Lemeshow test, a significant result (p &lt; 0.05) suggests an adequate fit. According to the data, the model has a chi-square of 56.356 (p-value &lt; 0.001), that is, the fitted model is better than the null model. The, we should conclude that the independent variables influence the dependent variable&#x2019;s variation
				<xref ref-type="fn" rid="fn26">
					<sup>26</sup>
				</xref>. We do not find these tests in Castro and Nunes&#x2019;s paper (2014), nor the computational scripts. 
				<xref ref-type="table" rid="t7">Table 7</xref> summarizes the coefficients estimated by the logistic regression model in an attempt to reproduce the results reported in 
				<xref ref-type="table" rid="t6">Table 6</xref> of 
				<xref ref-type="bibr" rid="B4">Castro and Nunes (2014)</xref>.
			</p>
			<table-wrap id="t7">
				<label>Table 7</label>
				<caption>
					<title>Logistic regression model coefficients
						<xref ref-type="table-fn" rid="TFN2">*</xref>
					</title>
				</caption>
				<table frame="hsides" rules="groups">
					<colgroup width="14%">
						<col/>
						<col/>
						<col/>
						<col/>
						<col/>
						<col/>
						<col/>
					</colgroup>
					<thead style="border-top: thin solid; border-bottom: 2px solid; border-color: #000000">
						<tr>
							<th align="left" valign="top"></th>
							<th align="center" valign="top">&#x03B2;</th>
							<th align="center" valign="top">Standard error</th>
							<th align="center" valign="top">Z(Wald)</th>
							<th align="center" valign="top">Sig.</th>
							<th align="center" valign="top">Exp(&#x03B2;)</th>
							<th align="center" valign="top">(exp(&#x03B2;)-1) x 100</th>
						</tr>
					</thead>
					<tbody style="border-bottom: thin solid; border-color: #000000">
						<tr>
							<td align="left" valign="top">(Intercept)</td>
							<td align="center" valign="top">0.552</td>
							<td align="center" valign="top">1.568</td>
							<td align="center" valign="top">0.352</td>
							<td align="center" valign="top">0.725</td>
							<td align="center" valign="top">1.737</td>
							<td align="center" valign="top">73.734</td>
						</tr>
						<tr>
							<td align="left" valign="top">Poverty</td>
							<td align="center" valign="top">1.171</td>
							<td align="center" valign="top">1.419</td>
							<td align="center" valign="top">0.825</td>
							<td align="center" valign="top">0.409</td>
							<td align="center" valign="top">3.224</td>
							<td align="center" valign="top">222.386</td>
						</tr>
						<tr>
							<td align="left" valign="top">Male</td>
							<td align="center" valign="top">-0.005</td>
							<td align="center" valign="top">0.560</td>
							<td align="center" valign="top">-0.009</td>
							<td align="center" valign="top">0.993</td>
							<td align="center" valign="top">0.995</td>
							<td align="center" valign="top">-0.484</td>
						</tr>
						<tr>
							<td align="left" valign="top">Age</td>
							<td align="center" valign="top">-0.014</td>
							<td align="center" valign="top">0.017</td>
							<td align="center" valign="top">-0.830</td>
							<td align="center" valign="top">0.406</td>
							<td align="center" valign="top">0.986</td>
							<td align="center" valign="top">-1.409</td>
						</tr>
						<tr>
							<td align="left" valign="top">Education</td>
							<td align="center" valign="top">-0.060</td>
							<td align="center" valign="top">0.161</td>
							<td align="center" valign="top">-0.370</td>
							<td align="center" valign="top">0.712</td>
							<td align="center" valign="top">0.942</td>
							<td align="center" valign="top">-5.789</td>
						</tr>
						<tr>
							<td align="left" valign="top">Ideology</td>
							<td align="center" valign="top">-0.125</td>
							<td align="center" valign="top">0.224</td>
							<td align="center" valign="top">-0.561</td>
							<td align="center" valign="top">0.575</td>
							<td align="center" valign="top">0.882</td>
							<td align="center" valign="top">-11.782</td>
						</tr>
						<tr>
							<td align="left" valign="top">Vote Increase</td>
							<td align="center" valign="top">0.908</td>
							<td align="center" valign="top">0.341</td>
							<td align="center" valign="top">2.663</td>
							<td align="center" valign="top">0.008</td>
							<td align="center" valign="top">2.480</td>
							<td align="center" valign="top">148.030</td>
						</tr>
						<tr>
							<td align="left" valign="top">Change</td>
							<td align="center" valign="top">0.078</td>
							<td align="center" valign="top">0.382</td>
							<td align="center" valign="top">0.205</td>
							<td align="center" valign="top">0.838</td>
							<td align="center" valign="top">1.081</td>
							<td align="center" valign="top">8.136</td>
						</tr>
						<tr>
							<td align="left" valign="top">
								<bold>Parlamentary amendments</bold>
							</td>
							<td align="center" valign="top">
								<bold>-0.272</bold>
							</td>
							<td align="center" valign="top">
								<bold>0.639</bold>
							</td>
							<td align="center" valign="top">
								<bold>-0.425</bold>
							</td>
							<td align="center" valign="top">
								<bold>0.671</bold>
							</td>
							<td align="center" valign="top">
								<bold>0.762</bold>
							</td>
							<td align="center" valign="top">
								<bold>-23.785</bold>
							</td>
						</tr>
						<tr>
							<td align="left" valign="top">Candidate/seats</td>
							<td align="center" valign="top">-0.005</td>
							<td align="center" valign="top">0.009</td>
							<td align="center" valign="top">-0.516</td>
							<td align="center" valign="top">0.606</td>
							<td align="center" valign="top">0.995</td>
							<td align="center" valign="top">-0.469</td>
						</tr>
						<tr>
							<td align="left" valign="top">
								<bold>Campaign spending</bold>
							</td>
							<td align="center" valign="top">
								<bold>0.000</bold>
							</td>
							<td align="center" valign="top">
								<bold>0.000</bold>
							</td>
							<td align="center" valign="top">
								<bold>3.920</bold>
							</td>
							<td align="center" valign="top">
								<bold>0.000</bold>
							</td>
							<td align="center" valign="top">
								<bold>1.000</bold>
							</td>
							<td align="center" valign="top">
								<bold>0.000</bold>
							</td>
						</tr>
						<tr>
							<td align="left" valign="top">
								<bold>Scandal</bold>
							</td>
							<td align="center" valign="top">
								<bold>-1.677</bold>
							</td>
							<td align="center" valign="top">
								<bold>0.528</bold>
							</td>
							<td align="center" valign="top">
								<bold>-3.176</bold>
							</td>
							<td align="center" valign="top">
								<bold>0.001</bold>
							</td>
							<td align="center" valign="top">
								<bold>0.187</bold>
							</td>
							<td align="center" valign="top">
								<bold>-81.299</bold>
							</td>
						</tr>
					</tbody>
				</table>
				<table-wrap-foot>
					<attrib>Source: The authors.</attrib>
					<fn id="TFN1">
						<p>Dependent variable: reelected.</p>
					</fn>
					<fn id="TFN2">
						<label>
							<sup>*</sup>
						</label>
						<p>As with any regression model, the unstandardized coefficients of variables in different scales cannot be directly compared. STATA has a command (listcoef, std help) which produces standardized coefficients in the independent, dependent, and both variables. 
							<xref ref-type="bibr" rid="B48">Menard (2004)</xref> presents six different ways to standardize coefficients in a logistic regression.
						</p>
					</fn>
				</table-wrap-foot>
			</table-wrap>
			<p>As with a linear regression, the first step is to analyze the estimated coefficients (&#x03B2;). Here, the research must observe the sign of the estimates and compare them with the direction expected in their hypotheses. X
				<sub>11</sub> (Scandal) has a negative effect (-1.677) on the probability of reelection. Unlike a linear model, logistic regression coefficients does not have an direct interpretation.
			</p>
			<p>There are two main ways of reading the coefficients: a) analyze the odds ratio and b) turn the odds ratio into a percentage. With the former, we conclude that involvement in corruption scandals reduces the chances of being elected. In terms of percentages, being involved in corruption diminishes in 81.2% the probability of being reelected, as theoretically expected by hypothesis 1. When considering campaign expenses, the effect was null, with an Exp (&#x03B2;) = 1.000.</p>
			<p>As in 
				<xref ref-type="bibr" rid="B4">Castro and Nunes (2014)</xref>, we did not find significant effects of the parliamentary amendment variable on the chance of reelection, considering the magnitude of the p-value and the standard error twice as large as the estimate of the impact itself
				<xref ref-type="fn" rid="fn27">
					<sup>27</sup>
				</xref>.
			</p>
			<p>After analyzing the coefficients associated with the variables of interest, the next step is to evaluate the quality of the model&#x2019;s fit. 
				<xref ref-type="table" rid="t8">Table 8</xref> summarizes some goodness-of-fit measures typically reported in models estimated by the maximum likelihood
				<xref ref-type="fn" rid="fn28">
					<sup>28</sup>
				</xref>.
			</p>
			<table-wrap id="t8">
				<label>Table 8</label>
				<caption>
					<title>Model goodness-of-fit measures
						<xref ref-type="table-fn" rid="TFN3">*</xref>
					</title>
				</caption>
				<table frame="hsides" rules="groups">
					<colgroup width="20%">
						<col/>
						<col/>
						<col/>
						<col/>
						<col/>
					</colgroup>
					<thead style="border-top: thin solid; border-bottom: 2px solid; border-color: #000000">
						<tr>
							<th align="left" valign="top">-2log likelihood null</th>
							<th align="center" valign="top">-2log likelihood</th>
							<th align="center" valign="top">Cox &amp; Snell R
								<sup>2</sup>
							</th>
							<th align="center" valign="top">Nagelkerke R
								<sup>2</sup>
							</th>
							<th align="center" valign="top">BIC</th>
						</tr>
					</thead>
					<tbody style="border-bottom: thin solid; border-color: #000000">
						<tr>
							<td align="left" valign="top">3,057,559</td>
							<td align="center" valign="top">237,4225</td>
							<td align="center" valign="top">0.229</td>
							<td align="center" valign="top">0.308</td>
							<td align="center" valign="top">301,891</td>
						</tr>
					</tbody>
				</table>
				<table-wrap-foot>
					<attrib>Source: The authors.</attrib>
					<fn id="TFN3">
						<label>
							<sup>*</sup>
						</label>
						<p>The - 2 log likelihood (-2LL) statistic is a fit measure. The smaller it is, the better the fit. The researcher may use it to compare the fit of different models (including and removing independent variables, but keeping the same dependent variable).</p>
					</fn>
				</table-wrap-foot>
			</table-wrap>
			<p>It is common for statistical packages to show in the output the number of iterations used by the computer to estimate the model. Informing that the model converged after iteration 5 means that the coefficients were estimated via maximum likelihood. Generally, the faster a model converges (less iterations), the better. If the model does not converge, the coefficients are unreliable. One of the main factors that explain a model&#x2019;s non-convergence is the insufficiency of cases in relation the number of independent variables included in the model.</p>
			<p>According to 
				<xref ref-type="bibr" rid="B47">Menard (2002)</xref>, the log likelihood is a measure of parameter selection in the logistic regression model. However, most statistical packages report the -2 log likelihood (-2LL) and its interpretation is as follows: the larger it is, the worse is the model&#x2019;s explanatory/predictive capacity. Intuitively, it can be interpreted as a measure of the error when trying to use a determined set of independent variables (model) to explain the dependent variable&#x2019;s variation. The researcher can request the iteration history of the estimation. The procedure will produce the -2 log likelihood of the null and the fitted models. The difference between them is measured with a chi-square. As it is an error measure, the larger the chi-square, the larger is the error reduction of the fitted model (with the independent variables), in relation to the null model.
			</p>
			<p>
				<xref ref-type="table" rid="t8">Table 8</xref> presents the value of -2LL to make comparing the models easier. In the null model the -2LL was 3,057,559 and the model with independent variables was 237,4225. In this case, we observe a considerable reduction. This means that the model with the independent variables has a superior fit to the null model. Similarly, the BIC (Bayesian Information Criterion) is another measure based on maximum likelihood. The smaller, the better. The model tested has a BIC of 301.891, while the null model&#x2019;s was 3,066.105. We can extrapolate that and compare several models, not just the null model.
			</p>
			<p>Unlike the linear model, a logistic regression does not have a synthetic measure of the variation in the dependent variable explained by the model, such as the coefficient of determination
				<xref ref-type="fn" rid="fn29">
					<sup>29</sup>
				</xref>. However, some measures were developed to guide the researcher regarding the explanatory/predictive power of the model
				<xref ref-type="fn" rid="fn30">
					<sup>30</sup>
				</xref>. The most commonly used are Cox &amp; Snell&#x2019;s pseudo R
				<sup>2</sup> of and Nagelkerke&#x2019;s
				<xref ref-type="fn" rid="fn31">
					<sup>31</sup>
				</xref> pseudo R
				<sup>2</sup>. For 
				<xref ref-type="bibr" rid="B47">Menard (2002)</xref>,
			</p>
			<disp-quote>
				<p>
					<italic>R</italic>
					<sub>
						<italic>i</italic>
					</sub>
					<xref ref-type="fn" rid="fn2">
						<sup>2</sup>
					</xref> is a proportional reduction in -2LL or a proportional reduction in the absolute value of the log-likelihood measure, where () the quantity being minimized to select the model parameters &#x2013; is taken as a measure of &#x2018;variation&#x2019;(
					<xref ref-type="bibr" rid="B46">Menard, 2002</xref>, p. 25).
				</p>
			</disp-quote>
			<p>For the purposes of this paper, we adopted the following interpretation: the closer to zero, the smaller is the difference between then null model (without any independent variables) and the estimated model. The closer to one, the larger is the difference between the null model and model proposed by the research. At an extreme, a pseudo R
				<sup>2</sup> of zero indicates that the independent variables included do not help to explain the variation of the dependent variable. A pseudo R
				<sup>2</sup> of 1 suggests that the variables explain/predict the variation in Y perfectly. Keeping in mind that we should be less demanding of a logistic model than a linear model in terms of variance explained by the R
				<sup>2</sup>.
			</p>
			<p>Lastly, a researcher must analyze the classification table. This report is particularly interesting because it gives a measure of the model&#x2019;s predictive capacity. 
				<xref ref-type="table" rid="t9">Table 9</xref> illustrates the information of interest.
			</p>
			<table-wrap id="t9">
				<label>Table 9</label>
				<caption>
					<title>Classification table</title>
				</caption>
				<table frame="hsides" rules="groups">
					<colgroup width="20%">
						<col/>
						<col/>
						<col/>
						<col/>
						<col/>
					</colgroup>
					<thead style="border-top: thin solid; border-bottom: 2px solid; border-color: #000000">
						<tr>
							<th align="left" valign="top"></th>
							<th align="center" valign="top"></th>
							<th align="center" valign="top" colspan="2" style="border-bottom: thin solid;">Predicted</th>
							<th align="center" valign="top">Total</th>
						</tr>
						<tr>
							<th align="left" valign="top"></th>
							<th align="center" valign="top"></th>
							<th align="center" valign="top">Not reelected</th>
							<th align="center" valign="top">Reelected</th>
							<th align="center" valign="top"></th>
						</tr>
					</thead>
					<tbody style="border-bottom: thin solid; border-color: #000000">
						<tr>
							<td align="left" valign="top">
								<bold>Real</bold>
							</td>
							<td align="center" valign="top">
								<bold>Not reelected</bold>
							</td>
							<td align="center" valign="top">23.50</td>
							<td align="center" valign="top">17.51</td>
							<td align="center" valign="top">41.01</td>
						</tr>
						<tr>
							<td align="left" valign="top"></td>
							<td align="center" valign="top">
								<bold>Reelected</bold>
							</td>
							<td align="center" valign="top">10.60</td>
							<td align="center" valign="top">48.39</td>
							<td align="center" valign="top">58.99</td>
						</tr>
						<tr>
							<td align="left" valign="top"></td>
							<td align="center" valign="top">
								<bold>Total</bold>
							</td>
							<td align="center" valign="top">34.10</td>
							<td align="center" valign="top">65.90</td>
							<td align="center" valign="top">100.00</td>
						</tr>
					</tbody>
				</table>
				<table-wrap-foot>
					<attrib>Source: The authors.</attrib>
				</table-wrap-foot>
			</table-wrap>
			<p>The classification table is frequently referred to as a confusion table. For 
				<xref ref-type="bibr" rid="B22">Garson (2011)</xref>,
			</p>
			<disp-quote>
				<p>Although classification hit rates (percent correct) as overall effect size measures are preferred over pseudo-R
					<sup>2</sup> measures, they to have some severe limitations for this purpose. Classification tables should not be used exclusively as goodness-of-fit measures because they ignore actual predicted probabilities and instead use dichotomized predictions based on a cutoff (ex.: 0.50). For instance, in binary logistic regression, predicting a 0-or-1 dependent, the classification table does not reveal how close to 1.0 the correct predictions were nor how close to 0.0 the errors were. A model in which the predictions, correct or not, were mostly close to the .50 cutoff does not have as good a fit as a model where the predicted scores cluster either near 1.0 or 0.0. Also, because the hit rate can vary markedly by sample for the same logistic model, use of the classification table to compare across samples is not recommended. (
					<xref ref-type="bibr" rid="B22">Garson, 2011</xref>, p. 173).
				</p>
			</disp-quote>
			<p>Our classification matrix uses the conventional standard of 50% to allocate cases as 1 (if the predicted probability is higher than 0.5) or 0 (smaller than 0.5). We can evaluate this table using three concepts: accuracy, sensibility, and specificity. The accuracy of the model is the proportion of true positive and true negative cases. According to 
				<xref ref-type="table" rid="t9">Table 9</xref>, the accuracy of our model was of 71.89% (23.50% + 48.29%). However, the accuracy of a model is not always the most important aspect. In certain cases, what is important is maximizing the rate of true positives or true negatives.
			</p>
			<p>Moving on to sensibility. It is the percentage of cases that has the feature of interest (was reelected) that were accurately predicted by the model (true positives / false positives + true positives). In our example, 48.39% of reelected candidates were correctly classified, out of a total of 58.99% that were actually reelected. This gives us a sensibility of 82.03% (48.39%/58.99%). The specificity of the model is the percentage of cases that do not have the feature of interest (were not reelected), that were correctly classified by the model, that is (true negatives / false negatives + true negatives). As we can see, 23.50% of non-reelected candidates were correctly identified out of a total of 41.01% of non-reelected. This gives us a specificity of 57.30% (23.50%/41.01%). There is a trade-off between sensibility and specificity. When increasing one, the other diminishes. Although sometimes the sensibility of the model is more important (predicting an illness, since one would be able to treat it), at other times it is best to increase specificity (keep corrupt politicians from being elected).</p>
		</sec>
		<sec>
			<title>VI. Conclusion</title>
			<p>We hope to help students and teachers to better understand how logistic regression works. The absence of calculus, linear and matrix algebra, and advanced statistics limits our ability to understand more advanced data analysis techniques. For this reason, our approach focused on the intuitive exposition of results. We also believe that understanding the intuitive logic of logistic regression is the first step to better understanding the different procedures that exist to deal with categorical data. Computational advances allow researchers with less specific training in Mathematics and Statistics to benefit from the advantages associated with the different multivariate techniques. Given that many variables in Political Science are categorical, the analytical benefits associated with the correct application and interpretation of a logistic model are evident. With this paper, we hope to disseminate the use of logistic regression.</p>
			<p>And how to improve the quality of methodological and technical training offered to Political Science undergraduate and graduate students in Brazil? We recommend the following: (1) incorporate of replication as a pedagogical tool in data analysis disciplines; (2) mandatory disciplines on mathematics, calculus, probability, and statistics in undergraduate and graduate curricula. In addition, students must receive training in some programming language; (3) conduct practical exercises involving data analysis with topics typical of Political Science. The emphasis onABSTRACT problems reduces students&#x2019; interests on the topic; (4) incentivize student participation in winter/summer courses such as MQ-UFMG and IPSA-USP; (5) promote epistemology and philosophy of science disciplines. The definition of research methods and techniques depend on the epistemological view of what is scientific knowledge and how it should be implemented; (6) diffuse critical reading of papers that use advanced data analysis techniques; (7) keep up with the academic production of journals specialized in methodology such as, for instance, 
				<italic>Political Analysis</italic> and 
				<italic>Political Science Research and Methods</italic>; (8) encourage the publication of methodological papers in national journals; (9) foster the creation of research groups and round-tables on methodology and data analysis techniques in professional conferences; (10) fund research projects especially devoted to deepening the knowledge on the main feature of science: method.
			</p>
		</sec>
	</body>
	<back>
		<ref-list>
			<title>References</title>
			<ref id="B1">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Altman</surname>
							<given-names>D.</given-names>
						</name>
					</person-group>
					<year>1991</year>
					<article-title>Categorising continuous variables</article-title>
					<source>British Journal of Cancer</source>
					<volume>64</volume>
					<issue>5</issue>
					<fpage>975</fpage>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.1038/bjc.1991.441</pub-id>
				</element-citation>
				<mixed-citation>Altman, D. (1991) Categorising continuous variables. 
					<italic>British Journal of Cancer</italic>, 64(5), p. 975. DOI: 10.1038/bjc.1991.441
				</mixed-citation>
			</ref>
			<ref id="B2">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Bonney</surname>
							<given-names>G.</given-names>
						</name>
					</person-group>
					<year>1987</year>
					<article-title>Logistic regression for dependent binary observations</article-title>
					<source>Biometrics</source>
					<volume>43</volume>
					<issue>4</issue>
					<fpage>951</fpage>
					<lpage>973</lpage>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.2307/2531548</pub-id>
				</element-citation>
				<mixed-citation>Bonney, G. (1987) Logistic regression for dependent binary observations. 
					<italic>Biometrics</italic>, 43(4), pp. 951-973. DOI: 10.2307/2531548
				</mixed-citation>
			</ref>
			<ref id="B3">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Brant</surname>
							<given-names>R.</given-names>
						</name>
					</person-group>
					<year>1996</year>
					<article-title>Digesting logistic regression results</article-title>
					<source>The American Statistician</source>
					<volume>50</volume>
					<issue>2</issue>
					<fpage>117</fpage>
					<lpage>119</lpage>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.2307/2684422</pub-id>
				</element-citation>
				<mixed-citation>Brant, R. (1996) Digesting logistic regression results. 
					<italic>The American Statistician</italic>, 50(2), pp. 117-119. DOI: 10.2307/2684422
				</mixed-citation>
			</ref>
			<ref id="B4">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Castro</surname>
							<given-names>M.M.M.</given-names>
						</name>
						<name>
							<surname>Nunes</surname>
							<given-names>F.</given-names>
						</name>
					</person-group>
					<year>2014</year>
					<article-title>Candidatos corruptos s&#x00E3;o punidos?: accountability na elei&#x00E7;&#x00E3;o brasileira de 2006</article-title>
					<source>Opini&#x00E3;o P&#x00FA;blica</source>
					<volume>20</volume>
					<issue>1</issue>
					<fpage>26</fpage>
					<lpage>48</lpage>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.1590/S0104-62762014000100002</pub-id>
				</element-citation>
				<mixed-citation>Castro, M.M.M. &amp; Nunes, F. (2014) Candidatos corruptos s&#x00E3;o punidos?: accountability na elei&#x00E7;&#x00E3;o brasileira de 2006. 
					<italic>Opini&#x00E3;o P&#x00FA;blica</italic>, 20(1), pp. 26-48. DOI: 10.1590/S0104-62762014000100002
				</mixed-citation>
			</ref>
			<ref id="B5">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Codato</surname>
							<given-names>A.</given-names>
						</name>
						<name>
							<surname>Cervi</surname>
							<given-names>E.</given-names>
						</name>
						<name>
							<surname>Perissinoto</surname>
							<given-names>R.</given-names>
						</name>
					</person-group>
					<year>2013</year>
					<article-title>Quem se elege prefeito no Brasil? Condicionantes do sucesso eleitoral em 2012</article-title>
					<source>Cadernos Adenauer</source>
					<volume>14</volume>
					<issue>2</issue>
					<fpage>61</fpage>
					<lpage>84</lpage>
				</element-citation>
				<mixed-citation>Codato, A.; Cervi, E. &amp; Perissinoto, R. (2013) Quem se elege prefeito no Brasil? Condicionantes do sucesso eleitoral em 2012. 
					<italic>Cadernos Adenauer</italic>, 14(2), pp. 61-84.
				</mixed-citation>
			</ref>
			<ref id="B6">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Cohen</surname>
							<given-names>J.</given-names>
						</name>
					</person-group>
					<year>1983</year>
					<article-title>The Cost of Dichotomization</article-title>
					<source>Applied Psychological Measurement</source>
					<volume>7</volume>
					<issue>3</issue>
					<fpage>249</fpage>
					<lpage>253</lpage>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.1177/014662168300700301</pub-id>
				</element-citation>
				<mixed-citation>Cohen, J. (1983) The Cost of Dichotomization. 
					<italic>Applied Psychological Measurement</italic>, 7(3), pp. 249&#x2013;253. DOI: 10.1177/014662168300700301
				</mixed-citation>
			</ref>
			<ref id="B7">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Cook</surname>
							<given-names>R.</given-names>
						</name>
						<name>
							<surname>Weisberg</surname>
							<given-names>S.</given-names>
						</name>
					</person-group>
					<year>1997</year>
					<article-title>Graphics for assessing the adequacy of regression models</article-title>
					<source>Journal of the American Statistical Association</source>
					<volume>92</volume>
					<issue>438</issue>
					<fpage>490</fpage>
					<lpage>499</lpage>
					<comment>DOI:</comment>
					<pub-id pub-id-type="doi">10.1080/01621459.1997.10474002</pub-id>
				</element-citation>
				<mixed-citation>Cook, R. &amp; Weisberg, S. (1997) Graphics for assessing the adequacy of regression models. 
					<italic>Journal of the American Statistical Association</italic>, 92(438), pp. 490-499. DOI:10.1080/01621459.1997.10474002
				</mixed-citation>
			</ref>
			<ref id="B8">
				<element-citation publication-type="book">
					<person-group person-group-type="author">
						<collab>R Core Team</collab>
					</person-group>
					<year>2019</year>
					<article-title>R: A language and environment for statistical computing</article-title>
					<source>R Foundation for Statistical Computing</source>
					<publisher-loc>Vienna, Austria</publisher-loc>
					<comment>Dispon&#x00ED;vel em: </comment>
					<ext-link ext-link-type="uri" xlink:href="https://www.R-project.org/">https://www.R-project.org/</ext-link>
					<comment>Acesso em: </comment>
					<date-in-citation>28 set. 2020</date-in-citation>
				</element-citation>
				<mixed-citation>R Core Team. (2019) R: A language and environment for statistical computing. 
					<italic>R Foundation for Statistical Computing</italic>. Vienna, Austria. Dispon&#x00ED;vel em: 
					<ext-link ext-link-type="uri" xlink:href="https://www.R-project.org/">https://www.R-project.org/</ext-link>. Acesso em: 28 set. 2020.
				</mixed-citation>
			</ref>
			<ref id="B9">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>DeMaris</surname>
							<given-names>A.</given-names>
						</name>
					</person-group>
					<year>1995</year>
					<article-title>A tutorial in logistic regression</article-title>
					<source>Journal of Marriage and the Family</source>
					<fpage>956</fpage>
					<lpage>968</lpage>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.2307/353415</pub-id>
				</element-citation>
				<mixed-citation>DeMaris, A. (1995) A tutorial in logistic regression. 
					<italic>Journal of Marriage and the Family</italic>, pp. 956-968. DOI: 10.2307/353415
				</mixed-citation>
			</ref>
			<ref id="B10">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Eno</surname>
							<given-names>D.</given-names>
						</name>
						<name>
							<surname>Terrell</surname>
							<given-names>G.</given-names>
						</name>
					</person-group>
					<year>1999</year>
					<article-title>Scatterplots for logistic regression</article-title>
					<source>Journal of Computational and Graphical Statistics</source>
					<volume>8</volume>
					<issue>3</issue>
					<fpage>413</fpage>
					<lpage>425</lpage>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.1080/10618600.1999.10474822</pub-id>
				</element-citation>
				<mixed-citation>Eno, D. &amp; Terrell, G. (1999) Scatterplots for logistic regression. 
					<italic>Journal of Computational and Graphical Statistics</italic>, 8(3), pp. 413-425. DOI: 10.1080/10618600.1999.10474822
				</mixed-citation>
			</ref>
			<ref id="B11">
				<element-citation publication-type="book">
					<person-group person-group-type="author">
						<name>
							<surname>Epstein</surname>
							<given-names>L.</given-names>
						</name>
						<name>
							<surname>Landes</surname>
							<given-names>W.</given-names>
						</name>
						<name>
							<surname>Posner</surname>
							<given-names>R.</given-names>
						</name>
					</person-group>
					<year>2013</year>
					<source>The behavior of federal judges: a theoretical and empirical study of rational choice</source>
					<publisher-loc>Cambridge</publisher-loc>
					<publisher-name>Harvard University Press</publisher-name>
				</element-citation>
				<mixed-citation>Epstein, L.; Landes, W. &amp; Posner, R. (2013) 
					<italic>The behavior of federal judges: a theoretical and empirical study of rational choice</italic>. Cambridge: Harvard University Press.
				</mixed-citation>
			</ref>
			<ref id="B12">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Fernandes</surname>
							<given-names>A.</given-names>
						</name>
						<etal/>
					</person-group>
					<year>2019</year>
					<article-title>Why quantitative variables should not be recoded as categorical</article-title>
					<source>Journal of Applied Mathematics and Physics</source>
					<volume>7</volume>
					<issue>7</issue>
					<fpage>1519</fpage>
					<lpage>1530</lpage>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.4236/jamp.2019.77103</pub-id>
				</element-citation>
				<mixed-citation>Fernandes, A. 
					<italic>et al.</italic> (2019) Why quantitative variables should not be recoded as categorical. 
					<italic>Journal of Applied Mathematics and Physics</italic>, 7(7), pp. 1519-1530. DOI: 10.4236/jamp.2019.77103
				</mixed-citation>
			</ref>
			<ref id="B13">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Figueiredo</surname>
							<given-names>D.</given-names>
							<suffix>Filho</suffix>
						</name>
						<name>
							<surname>Silva</surname>
							<given-names>L.</given-names>
						</name>
						<name>
							<surname>Domingos</surname>
							<given-names>A.</given-names>
						</name>
					</person-group>
					<year>2015</year>
					<article-title>O Que &#x00E9; e como Superar a Multicolinearidade? Um Guia Para Ci&#x00EA;ncia Pol&#x00ED;tica</article-title>
					<source>Conex&#x00E3;o Pol&#x00ED;tica</source>
					<volume>4</volume>
					<issue>2</issue>
					<fpage>95</fpage>
					<lpage>104</lpage>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.26694/rcp.issn.2317-3254.v4e2.2015.p95-104</pub-id>
				</element-citation>
				<mixed-citation>Figueiredo Filho, D.; Silva, L. &amp; Domingos, A. (2015) O Que &#x00E9; e como Superar a Multicolinearidade? Um Guia Para Ci&#x00EA;ncia Pol&#x00ED;tica. 
					<italic>Conex&#x00E3;o Pol&#x00ED;tica</italic>, 4(2), pp. 95-104. DOI: 10.26694/rcp.issn.2317-3254.v4e2.2015.p95-104
				</mixed-citation>
			</ref>
			<ref id="B14">
				<element-citation publication-type="book">
					<person-group person-group-type="author">
						<name>
							<surname>Figueiredo</surname>
							<given-names>D.</given-names>
							<suffix>Filho</suffix>
						</name>
						<name>
							<surname>Silva</surname>
							<given-names>J.</given-names>
							<suffix>J&#x00FA;nior</suffix>
						</name>
					</person-group>
					<year>2016</year>
					<article-title>O outlier que pertuba o seu sono: Como identificar casos extremos?</article-title>
					<comment>In: </comment>
					<source>10&#x00BA; Encontro da Associa&#x00E7;&#x00E3;o Brasileira de Ci&#x00EA;ncia Pol&#x00ED;tica</source>
					<publisher-loc>Belo Horizonte</publisher-loc>
					<comment>Dispon&#x00ED;vel em: </comment>
					<ext-link ext-link-type="uri" xlink:href="https://cienciapolitica.org.br/system/files/documentos/eventos/2017/04/outlier-que-pertuba-seu-sono-como-identificar-e-manejar.pdf">https://cienciapolitica.org.br/system/files/documentos/eventos/2017/04/outlier-que-pertuba-seu-sono-como-identificar-e-manejar.pdf</ext-link>
					<comment>Acesso em: </comment>
					<date-in-citation>13 out. 2020</date-in-citation>
				</element-citation>
				<mixed-citation>Figueiredo Filho, D. &amp; Silva J&#x00FA;nior, J. (2016) O outlier que pertuba o seu sono: Como identificar casos extremos? In: 
					<italic>10&#x00BA; Encontro da Associa&#x00E7;&#x00E3;o Brasileira de Ci&#x00EA;ncia Pol&#x00ED;tica</italic>. Belo Horizonte. Dispon&#x00ED;vel em: 
					<ext-link ext-link-type="uri" xlink:href="https://cienciapolitica.org.br/system/files/documentos/eventos/2017/04/outlier-que-pertuba-seu-sono-como-identificar-e-manejar.pdf">https://cienciapolitica.org.br/system/files/documentos/eventos/2017/04/outlier-que-pertuba-seu-sono-como-identificar-e-manejar.pdf</ext-link>. Acesso em: 13 out. 2020.
				</mixed-citation>
			</ref>
			<ref id="B15">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Figueiredo</surname>
							<given-names>D.</given-names>
							<suffix>Filho</suffix>
						</name>
						<name>
							<surname>Silva</surname>
							<given-names>J.</given-names>
							<suffix>J&#x00FA;nior</suffix>
						</name>
					</person-group>
					<year>2010</year>
					<article-title>Desvendando os Mist&#x00E9;rios do Coeficiente de Correla&#x00E7;&#x00E3;o de Pearson (r)</article-title>
					<source>Revista Pol&#x00ED;tica Hoje</source>
					<volume>18</volume>
					<issue>1</issue>
					<fpage>115</fpage>
					<lpage>146</lpage>
					<comment>Dispon&#x00ED;vel em: </comment>
					<ext-link ext-link-type="uri" xlink:href="https://periodicos.ufpe.br/revistas/politicahoje/article/view/3852">https://periodicos.ufpe.br/revistas/politicahoje/article/view/3852</ext-link>
					<comment>Acesso em: </comment>
					<date-in-citation>15 de maio 2020</date-in-citation>
				</element-citation>
				<mixed-citation>Figueiredo Filho, D. &amp; Silva J&#x00FA;nior, J. (2010) Desvendando os Mist&#x00E9;rios do Coeficiente de Correla&#x00E7;&#x00E3;o de Pearson (r). 
					<italic>Revista Pol&#x00ED;tica Hoje.</italic> 18(1), pp. 115-146. Dispon&#x00ED;vel em: &lt;
					<ext-link ext-link-type="uri" xlink:href="https://periodicos.ufpe.br/revistas/politicahoje/article/view/3852">https://periodicos.ufpe.br/revistas/politicahoje/article/view/3852</ext-link>&gt;. Acesso em: 15 de maio 2020.
				</mixed-citation>
			</ref>
			<ref id="B16">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Figueiredo</surname>
							<given-names>D.</given-names>
							<suffix>Filho</suffix>
						</name>
						<etal/>
					</person-group>
					<year>2011</year>
					<article-title>O que fazer e o que n&#x00E3;o fazer com a regress&#x00E3;o: pressupostos e aplica&#x00E7;&#x00F5;es do modelo linear de M&#x00ED;nimos Quadrados Ordin&#x00E1;rios (MQO)</article-title>
					<source>Revista Pol&#x00ED;tica Hoje</source>
					<volume>20</volume>
					<issue>1</issue>
					<fpage>44</fpage>
					<lpage>99</lpage>
					<comment>Dispon&#x00ED;vel em: </comment>
					<ext-link ext-link-type="uri" xlink:href="https://periodicos.ufpe.br/revistas/politicahoje/article/view/3808/31622">https://periodicos.ufpe.br/revistas/politicahoje/article/view/3808/31622</ext-link>
					<date-in-citation>Acesso em: </date-in-citation>
					<date-in-citation>15 de maio 2020</date-in-citation>
				</element-citation>
				<mixed-citation>Figueiredo Filho, D. 
					<italic>et al.</italic> (2011) O que fazer e o que n&#x00E3;o fazer com a regress&#x00E3;o: pressupostos e aplica&#x00E7;&#x00F5;es do modelo linear de M&#x00ED;nimos Quadrados Ordin&#x00E1;rios (MQO). 
					<italic>Revista Pol&#x00ED;tica Hoje</italic>, 20(1), pp. 44-99. Dispon&#x00ED;vel em: &lt;
					<ext-link ext-link-type="uri" xlink:href="https://periodicos.ufpe.br/revistas/politicahoje/article/view/3808/31622">https://periodicos.ufpe.br/revistas/politicahoje/article/view/3808/31622</ext-link>&gt;. Acesso em: 15 de maio 2020.
				</mixed-citation>
			</ref>
			<ref id="B17">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Figueiredo</surname>
							<given-names>D.</given-names>
							<suffix>Filho</suffix>
						</name>
						<etal/>
					</person-group>
					<year>2019</year>
					<article-title>Seven Reasons Why: A User&#x0027;s Guide to Transparency and Reproducibility</article-title>
					<source>Brazilian Political Science Review</source>
					<volume>13</volume>
					<issue>2</issue>
					<fpage>e0001</fpage>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.1590/1981-3821201900020001</pub-id>
				</element-citation>
				<mixed-citation>Figueiredo Filho, D. 
					<italic>et al.</italic> (2019) Seven Reasons Why: A User&#x0027;s Guide to Transparency and Reproducibility. 
					<italic>Brazilian Political Science Review</italic>, 13(2), pp. e0001. DOI: 10.1590/1981-3821201900020001
				</mixed-citation>
			</ref>
			<ref id="B18">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Figueiredo</surname>
							<given-names>D.</given-names>
							<suffix>Filho</suffix>
						</name>
						<name>
							<surname>Silva</surname>
							<given-names>J.</given-names>
							<suffix>J&#x00FA;nior</suffix>
						</name>
						<name>
							<surname>Rocha</surname>
							<given-names>E.</given-names>
						</name>
					</person-group>
					<year>2012</year>
					<article-title>Classificando regimes pol&#x00ED;ticos utilizando an&#x00E1;lise de conglomerados</article-title>
					<source>Opini&#x00E3;o P&#x00FA;blica</source>
					<volume>18</volume>
					<issue>1</issue>
					<fpage>109</fpage>
					<lpage>128</lpage>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.1590/S0104-62762012000100006</pub-id>
				</element-citation>
				<mixed-citation>Figueiredo Filho, D.; Silva J&#x00FA;nior, J. &amp; Rocha, E. (2012) Classificando regimes pol&#x00ED;ticos utilizando an&#x00E1;lise de conglomerados. 
					<italic>Opini&#x00E3;o P&#x00FA;blica</italic>, 18(1), pp. 109-128. DOI: 10.1590/S0104-62762012000100006
				</mixed-citation>
			</ref>
			<ref id="B19">
				<element-citation publication-type="book">
					<person-group person-group-type="author">
						<name>
							<surname>Fox</surname>
							<given-names>J.</given-names>
						</name>
					</person-group>
					<year>1991</year>
					<source>Regression diagnostics: An introduction</source>
					<volume>79</volume>
					<publisher-loc>Thousand Oaks, CA</publisher-loc>
					<publisher-name>Sage Publications</publisher-name>
				</element-citation>
				<mixed-citation>Fox, J. (1991) 
					<italic>Regression diagnostics: An introduction</italic> Vol. 79. Thousand Oaks, CA: Sage Publications.
				</mixed-citation>
			</ref>
			<ref id="B20">
				<element-citation publication-type="thesis">
					<person-group person-group-type="author">
						<name>
							<surname>Freitas</surname>
							<given-names>L.</given-names>
						</name>
					</person-group>
					<year>2013</year>
					<source>Compara&#x00E7;&#x00E3;o das fun&#x00E7;&#x00F5;es de liga&#x00E7;&#x00E3;o logit e probit em regress&#x00E3;o bin&#x00E1;ria considerando diferentes tamanhos amostrais</source>
					<comment>Tese de Doutorado</comment>
					<publisher-loc>Vi&#x00E7;osa</publisher-loc>
					<institution content-type="institution">Universidade Federal de Vi&#x00E7;osa</institution>
				</element-citation>
				<mixed-citation>Freitas, L. (2013) 
					<italic>Compara&#x00E7;&#x00E3;o das fun&#x00E7;&#x00F5;es de liga&#x00E7;&#x00E3;o logit e probit em regress&#x00E3;o bin&#x00E1;ria considerando diferentes tamanhos amostrais</italic>. Tese de Doutorado. Vi&#x00E7;osa: Universidade Federal de Vi&#x00E7;osa.
				</mixed-citation>
			</ref>
			<ref id="B21">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Furlong</surname>
							<given-names>E.</given-names>
						</name>
					</person-group>
					<year>1998</year>
					<article-title>A logistic regression model explaining recent state casino gaming adoptions</article-title>
					<source>Policy Studies Journal</source>
					<volume>26</volume>
					<issue>3</issue>
					<fpage>371</fpage>
					<lpage>383</lpage>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.1111/j.1541-0072.1998.tb01907.x</pub-id>
				</element-citation>
				<mixed-citation>Furlong, E. (1998) A logistic regression model explaining recent state casino gaming adoptions. 
					<italic>Policy Studies Journal</italic>, 
					<italic>26</italic>(3), pp. 371-383, DOI: 10.1111/j.1541-0072.1998.tb01907.x
				</mixed-citation>
			</ref>
			<ref id="B22">
				<element-citation publication-type="webpage">
					<person-group person-group-type="author">
						<name>
							<surname>Garson</surname>
							<given-names>G.D.</given-names>
						</name>
					</person-group>
					<year>2011</year>
					<article-title>Multiple regression: Overview</article-title>
					<source>Statnotes: Topics in Multivariate Analysis</source>
					<comment>Dispon&#x00ED;vel em: </comment>
					<ext-link ext-link-type="uri" xlink:href="https://faculty.chass.ncsu.edu/garson/PA765/statnote.htm">https://faculty.chass.ncsu.edu/garson/PA765/statnote.htm</ext-link>
					<comment>Acesso em: </comment>
					<date-in-citation>13 set. 2020</date-in-citation>
				</element-citation>
				<mixed-citation>Garson, G.D. (2011) 
					<italic>Multiple regression: Overview</italic>. Statnotes: Topics in Multivariate Analysis. Dispon&#x00ED;vel em: 
					<ext-link ext-link-type="uri" xlink:href="https://faculty.chass.ncsu.edu/garson/PA765/statnote.htm">https://faculty.chass.ncsu.edu/garson/PA765/statnote.htm</ext-link>. Acesso em: 13 set. 2020.
				</mixed-citation>
			</ref>
			<ref id="B23">
				<element-citation publication-type="book">
					<person-group person-group-type="author">
						<name>
							<surname>Garson</surname>
							<given-names>G.D.</given-names>
						</name>
					</person-group>
					<year>2014</year>
					<source>Logistic Regression: Binary and Multinomial</source>
					<publisher-loc>[s.l.]</publisher-loc>
					<publisher-name>Statistical Associates Publishing</publisher-name>
				</element-citation>
				<mixed-citation>Garson, G.D. (2014) 
					<italic>Logistic Regression: Binary and Multinomial</italic>. [s.l.]: Statistical Associates Publishing.
				</mixed-citation>
			</ref>
			<ref id="B24">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Goldsmith</surname>
							<given-names>B.</given-names>
						</name>
						<name>
							<surname>Chalup</surname>
							<given-names>S.</given-names>
						</name>
						<name>
							<surname>Quilan</surname>
							<given-names>M.</given-names>
						</name>
					</person-group>
					<year>2008</year>
					<article-title>Regime type and international conflict: towards a general model</article-title>
					<source>Journal of Peace Research</source>
					<volume>45</volume>
					<issue>6</issue>
					<fpage>743</fpage>
					<lpage>763</lpage>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.1177/0022343308096154</pub-id>
				</element-citation>
				<mixed-citation>Goldsmith, B.; Chalup, S. &amp; Quilan, M. (2008) Regime type and international conflict: towards a general model. 
					<italic>Journal of Peace Research</italic>, 45(6), pp. 743-763. DOI: 10.1177/0022343308096154
				</mixed-citation>
			</ref>
			<ref id="B25">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Guthery</surname>
							<given-names>F.</given-names>
						</name>
						<name>
							<surname>Binghan</surname>
							<given-names>R.</given-names>
						</name>
					</person-group>
					<year>2007</year>
					<article-title>A primer on interpreting regression models</article-title>
					<source>The Journal of Wildlife Management</source>
					<volume>71</volume>
					<issue>3</issue>
					<fpage>684</fpage>
					<lpage>692</lpage>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.2193/2006-285</pub-id>
				</element-citation>
				<mixed-citation>Guthery, F. &amp; Binghan, R. (2007) A primer on interpreting regression models. 
					<italic>The Journal of Wildlife Management</italic>, 71(3), pp. 684-692. DOI: 10.2193/2006-285
				</mixed-citation>
			</ref>
			<ref id="B26">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Hagle</surname>
							<given-names>T.</given-names>
						</name>
						<name>
							<surname>Mitchell</surname>
							<given-names>G.</given-names>
						</name>
					</person-group>
					<year>1992</year>
					<article-title>Goodness-of-fit measures for probit and logit</article-title>
					<source>American Journal of Political Science</source>
					<volume>36</volume>
					<issue>3</issue>
					<fpage>762</fpage>
					<lpage>784</lpage>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.2307/2111590</pub-id>
				</element-citation>
				<mixed-citation>Hagle, T. &amp; Mitchell, G. (1992) Goodness-of-fit measures for probit and logit. 
					<italic>American Journal of Political Science</italic>, 36(3), pp. 762-784. DOI: 10.2307/2111590
				</mixed-citation>
			</ref>
			<ref id="B27">
				<element-citation publication-type="book">
					<person-group person-group-type="author">
						<name>
							<surname>Hair</surname>
							<given-names>J.</given-names>
						</name>
						<etal/>
					</person-group>
					<year>2009</year>
					<source>An&#x00E1;lise multivariada de dados</source>
					<publisher-loc>Porto Alegre</publisher-loc>
					<publisher-name>Bookman Editora</publisher-name>
				</element-citation>
				<mixed-citation>Hair, J. 
					<italic>et al.</italic> (2009) 
					<italic>An&#x00E1;lise multivariada de dados</italic>. Porto Alegre: Bookman Editora.
				</mixed-citation>
			</ref>
			<ref id="B28">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Henderson</surname>
							<given-names>E.</given-names>
						</name>
						<name>
							<surname>Singer</surname>
							<given-names>J.</given-names>
						</name>
					</person-group>
					<year>2000</year>
					<article-title>Civil war in the post-colonial world, 1946-92</article-title>
					<source>Journal of Peace Research</source>
					<volume>37</volume>
					<issue>3</issue>
					<fpage>275</fpage>
					<lpage>299</lpage>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.1177/0022343300037003001</pub-id>
				</element-citation>
				<mixed-citation>Henderson, E. &amp; Singer, J. (2000) Civil war in the post-colonial world, 1946-92. 
					<italic>Journal of Peace Research</italic>, 37(3), pp. 275-299. DOI: 10.1177/0022343300037003001
				</mixed-citation>
			</ref>
			<ref id="B29">
				<element-citation publication-type="book">
					<person-group person-group-type="author">
						<name>
							<surname>Hilbe</surname>
							<given-names>J.</given-names>
						</name>
					</person-group>
					<year>2009</year>
					<source>Logistic regression models</source>
					<publisher-loc>London</publisher-loc>
					<publisher-name>Chapman and Hall/CRC</publisher-name>
				</element-citation>
				<mixed-citation>Hilbe, J. (2009) 
					<italic>Logistic regression models</italic>. London: Chapman and Hall/CRC.
				</mixed-citation>
			</ref>
			<ref id="B30">
				<element-citation publication-type="book">
					<person-group person-group-type="author">
						<name>
							<surname>Hosmer</surname>
							<given-names>D.</given-names>
							<suffix>Jr</suffix>
						</name>
						<name>
							<surname>Lemeshow</surname>
							<given-names>S.</given-names>
						</name>
						<name>
							<surname>Sturdvanty</surname>
							<given-names>R.</given-names>
						</name>
					</person-group>
					<year>2013</year>
					<source>Applied logistic regression</source>
					<volume>398</volume>
					<publisher-loc>New York</publisher-loc>
					<publisher-name>John Wiley &amp; Sons</publisher-name>
				</element-citation>
				<mixed-citation>Hosmer Jr, D.; Lemeshow, S. &amp; Sturdvanty, R. (2013) 
					<italic>Applied logistic regression</italic> Vol. 398. New York: John Wiley &amp; Sons.
				</mixed-citation>
			</ref>
			<ref id="B31">
				<element-citation publication-type="book">
					<person-group person-group-type="author">
						<name>
							<surname>Hosmer</surname>
							<given-names>D.</given-names>
							<suffix>Jr</suffix>
						</name>
						<name>
							<surname>Lemeshow</surname>
							<given-names>S.</given-names>
						</name>
					</person-group>
					<year>2000</year>
					<source>Applied Logistic Regression</source>
					<publisher-loc>New York</publisher-loc>
					<publisher-name>John Wiley &amp; Sons</publisher-name>
				</element-citation>
				<mixed-citation>Hosmer Jr, D. &amp; Lemeshow, S. (2000) 
					<italic>Applied Logistic Regression</italic>. New York: John Wiley &amp; Sons.
				</mixed-citation>
			</ref>
			<ref id="B32">
				<element-citation publication-type="book">
					<person-group person-group-type="author">
						<name>
							<surname>Jaccard</surname>
							<given-names>J.</given-names>
						</name>
						<name>
							<surname>Jaccard</surname>
							<given-names>J.</given-names>
						</name>
					</person-group>
					<year>2001</year>
					<source>Interaction effects in logistic regression</source>
					<publisher-loc>Thousand Oaks</publisher-loc>
					<publisher-name>Sage Publications</publisher-name>
				</element-citation>
				<mixed-citation>Jaccard, J. &amp; Jaccard, J. (2001) 
					<italic>Interaction effects in logistic regression</italic>. Thousand Oaks: Sage Publications.
				</mixed-citation>
			</ref>
			<ref id="B33">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Janz</surname>
							<given-names>N.</given-names>
						</name>
					</person-group>
					<year>2016</year>
					<article-title>Bringing the gold standard into the classroom: replication in university teaching</article-title>
					<source>International Studies Perspectives</source>
					<volume>17</volume>
					<issue>4</issue>
					<fpage>392</fpage>
					<lpage>407</lpage>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.1111/insp.12104</pub-id>
				</element-citation>
				<mixed-citation>Janz, N. (2016) Bringing the gold standard into the classroom: replication in university teaching. 
					<italic>International Studies Perspectives</italic>, 17(4), pp. 392-407. DOI: 10.1111/insp.12104
				</mixed-citation>
			</ref>
			<ref id="B34">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Kay</surname>
							<given-names>R.</given-names>
						</name>
						<name>
							<surname>Little</surname>
							<given-names>S.</given-names>
						</name>
					</person-group>
					<year>1987</year>
					<article-title>Transformations of the explanatory variables in the logistic regression model for binary data</article-title>
					<source>Biometrika</source>
					<volume>74</volume>
					<issue>3</issue>
					<fpage>495</fpage>
					<lpage>501</lpage>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.2307/2336688</pub-id>
				</element-citation>
				<mixed-citation>Kay, R. &amp; Little, S. (1987) Transformations of the explanatory variables in the logistic regression model for binary data. 
					<italic>Biometrika</italic>, 74(3), pp. 495-501. DOI: 10.2307/2336688
				</mixed-citation>
			</ref>
			<ref id="B35">
				<element-citation publication-type="book">
					<person-group person-group-type="author">
						<name>
							<surname>Kennedy</surname>
							<given-names>P.</given-names>
						</name>
					</person-group>
					<year>2005</year>
					<source>A guide to econometrics</source>
					<publisher-loc>Oxford</publisher-loc>
					<publisher-name>Maldon</publisher-name>
				</element-citation>
				<mixed-citation>Kennedy, P. (2005) 
					<italic>A guide to econometrics</italic>, Oxford: Maldon.
				</mixed-citation>
			</ref>
			<ref id="B36">
				<element-citation publication-type="webpage">
					<person-group person-group-type="author">
						<name>
							<surname>Keprt</surname>
							<given-names>A.</given-names>
						</name>
						<name>
							<surname>Sn&#x00E1;sel</surname>
							<given-names>V.</given-names>
						</name>
					</person-group>
					<year>2004</year>
					<article-title>Binary Factor Analysis with Help of Formal Concepts</article-title>
					<comment>In: </comment>
					<source>The Second International Conference on Concept Lattices and Their Applications (CLA). Ostrava</source>
					<fpage>90</fpage>
					<lpage>101</lpage>
					<comment>Dispon&#x00ED;vel em: </comment>
					<ext-link ext-link-type="uri" xlink:href="http://ceur-ws.org/Vol-110/paper10.pdf">http://ceur-ws.org/Vol-110/paper10.pdf</ext-link>
					<comment>Acesso em: </comment>
					<date-in-citation>13 out. 2020</date-in-citation>
				</element-citation>
				<mixed-citation>Keprt, A. &amp; Sn&#x00E1;sel, V. (2004) Binary Factor Analysis with Help of Formal Concepts. In: 
					<italic>The Second International Conference on Concept Lattices and Their Applications (CLA)</italic>. Ostrava, pp. 90-101. Dispon&#x00ED;vel em: 
					<ext-link ext-link-type="uri" xlink:href="http://ceur-ws.org/Vol-110/paper10.pdf">http://ceur-ws.org/Vol-110/paper10.pdf</ext-link>. Acesso em: 13 out. 2020.
				</mixed-citation>
			</ref>
			<ref id="B37">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>King</surname>
							<given-names>G.</given-names>
						</name>
					</person-group>
					<year>1986</year>
					<article-title>How not to lie with statistics: Avoiding common mistakes in quantitative political science</article-title>
					<source>American Journal of Political Science</source>
					<volume>30</volume>
					<issue>3</issue>
					<fpage>666</fpage>
					<lpage>687</lpage>
					<comment>Dispon&#x00ED;vel em: </comment>
					<ext-link ext-link-type="uri" xlink:href="https://ssrn.com/ABSTRACT=084228">https://ssrn.com/ABSTRACT=084228</ext-link>
					<comment>Acesso em: </comment>
					<date-in-citation>28 set. 2020</date-in-citation>
				</element-citation>
				<mixed-citation>King, G. (1986) How not to lie with statistics: Avoiding common mistakes in quantitative political science. 
					<italic>American Journal of Political Science</italic>, 30(3), pp. 666-687. Dispon&#x00ED;vel em: 
					<ext-link ext-link-type="uri" xlink:href="https://ssrn.com/ABSTRACT=084228">https://ssrn.com/ABSTRACT=084228</ext-link>. Acesso em: 28 set. 2020.
				</mixed-citation>
			</ref>
			<ref id="B38">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>King</surname>
							<given-names>G.</given-names>
						</name>
					</person-group>
					<year>1995</year>
					<article-title>Replication, replication</article-title>
					<source>PS: Political Science &amp; Politics</source>
					<volume>28</volume>
					<issue>3</issue>
					<fpage>444</fpage>
					<lpage>452</lpage>
					<comment>Dispon&#x00ED;vel em: </comment>
					<ext-link ext-link-type="uri" xlink:href="https://gking.harvard.edu/files/gking/files/replication.pdf">https://gking.harvard.edu/files/gking/files/replication.pdf</ext-link>
					<comment>Acesso em: </comment>
					<date-in-citation>13 out. 2020</date-in-citation>
				</element-citation>
				<mixed-citation>King, G. (1995) Replication, replication. 
					<italic>PS: Political Science &amp; Politics</italic>, 28(3), pp. 444-452. Dispon&#x00ED;vel em: 
					<ext-link ext-link-type="uri" xlink:href="https://gking.harvard.edu/files/gking/files/replication.pdf">https://gking.harvard.edu/files/gking/files/replication.pdf</ext-link>. Acesso em: 13 out. 2020.
				</mixed-citation>
			</ref>
			<ref id="B39">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>King</surname>
							<given-names>G.</given-names>
						</name>
						<name>
							<surname>Zeng</surname>
							<given-names>L.</given-names>
						</name>
					</person-group>
					<year>2001</year>
					<article-title>Logistic regression in rare events data</article-title>
					<source>Political analysis</source>
					<volume>9</volume>
					<issue>2</issue>
					<fpage>137</fpage>
					<lpage>163</lpage>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.1093/oxfordjournals.pan.a004868</pub-id>
				</element-citation>
				<mixed-citation>King, G. &amp; Zeng, L. (2001) Logistic regression in rare events data. 
					<italic>Political analysis</italic>, 9(2), pp. 137-163. DOI: 10.1093/oxfordjournals.pan.a004868
				</mixed-citation>
			</ref>
			<ref id="B40">
				<element-citation publication-type="book">
					<person-group person-group-type="author">
						<name>
							<surname>Kleinbaum</surname>
							<given-names>D.</given-names>
						</name>
						<name>
							<surname>Klein</surname>
							<given-names>M.</given-names>
						</name>
					</person-group>
					<year>2010</year>
					<source>Logistic regression: A Self-Learning Text</source>
					<publisher-loc>New York</publisher-loc>
					<publisher-name>Springer-Verlag</publisher-name>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.1007/978-1-4419-1742-3</pub-id>
				</element-citation>
				<mixed-citation>Kleinbaum, D. &amp; Klein, M. (2010) 
					<italic>Logistic regression: A Self-Learning Text</italic>. New York: Springer-Verlag. DOI: 10.1007/978-1-4419-1742-3
				</mixed-citation>
			</ref>
			<ref id="B41">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Krueger</surname>
							<given-names>J.</given-names>
						</name>
						<name>
							<surname>Lewis-Beck</surname>
							<given-names>M.</given-names>
						</name>
					</person-group>
					<year>2008</year>
					<article-title>Is ols dead?</article-title>
					<source>The Political Methodologist</source>
					<volume>15</volume>
					<issue>2</issue>
					<fpage>2</fpage>
					<lpage>4</lpage>
					<comment>2008</comment>
				</element-citation>
				<mixed-citation>Krueger, J. &amp; Lewis-Beck, M. (2008) Is ols dead? 
					<italic>The Political Methodologist</italic>, 15(2), pp. 2-4, 2008.
				</mixed-citation>
			</ref>
			<ref id="B42">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Landwehr</surname>
							<given-names>J.</given-names>
						</name>
						<name>
							<surname>Pregibon</surname>
							<given-names>D.</given-names>
						</name>
						<name>
							<surname>Shoemaker</surname>
							<given-names>A.</given-names>
						</name>
					</person-group>
					<year>1984</year>
					<article-title>Graphical methods for assessing logistic regression models</article-title>
					<source>Journal of the American Statistical Association</source>
					<volume>79</volume>
					<issue>385</issue>
					<fpage>61</fpage>
					<lpage>71</lpage>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.1080/01621459.1984.10477062</pub-id>
				</element-citation>
				<mixed-citation>Landwehr, J.; Pregibon, D. &amp; Shoemaker, A. (1984) Graphical methods for assessing logistic regression models. 
					<italic>Journal of the American Statistical Association</italic>, 79(385), pp. 61-71. DOI: 10.1080/01621459.1984.10477062
				</mixed-citation>
			</ref>
			<ref id="B43">
				<element-citation publication-type="book">
					<person-group person-group-type="author">
						<name>
							<surname>Lewis-Beck</surname>
							<given-names>M.</given-names>
						</name>
					</person-group>
					<year>1980</year>
					<source>Applied Regression</source>
					<publisher-loc>Thousand Oaks</publisher-loc>
					<publisher-name>Sage Publications</publisher-name>
				</element-citation>
				<mixed-citation>Lewis-Beck, M. (1980) 
					<italic>Applied Regression.</italic> Thousand Oaks: Sage Publications.
				</mixed-citation>
			</ref>
			<ref id="B44">
				<element-citation publication-type="book">
					<person-group person-group-type="author">
						<name>
							<surname>Long</surname>
							<given-names>J.</given-names>
						</name>
					</person-group>
					<year>1997</year>
					<article-title>Regression models for categorical and limited dependent variables</article-title>
					<source>Advanced quantitative techniques in the social sciences</source>
					<volume>7</volume>
					<issue>s/n</issue>
					<publisher-loc>Thousand Oaks</publisher-loc>
					<publisher-name>Sage Publications</publisher-name>
				</element-citation>
				<mixed-citation>Long, J. (1997) Regression models for categorical and limited dependent variables. 
					<italic>Advanced quantitative techniques in the social sciences</italic>, 7(s/n), Thousand Oaks: Sage Publications.
				</mixed-citation>
			</ref>
			<ref id="B45">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Lottes</surname>
							<given-names>I.</given-names>
						</name>
						<name>
							<surname>DeMaris</surname>
							<given-names>A.</given-names>
						</name>
						<name>
							<surname>Adler</surname>
							<given-names>M.</given-names>
						</name>
					</person-group>
					<year>1996</year>
					<article-title>Using and interpreting logistic regression: A guide for teachers and students</article-title>
					<source>Teaching Sociology</source>
					<volume>24</volume>
					<issue>3</issue>
					<fpage>284</fpage>
					<lpage>298</lpage>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.2307/1318743</pub-id>
				</element-citation>
				<mixed-citation>Lottes, I.; DeMaris, A. &amp; Adler, M. (1996) Using and interpreting logistic regression: A guide for teachers and students. 
					<italic>Teaching Sociology</italic>, 24(3), pp. 284-298. DOI: 10.2307/1318743
				</mixed-citation>
			</ref>
			<ref id="B46">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Menard</surname>
							<given-names>S.</given-names>
						</name>
					</person-group>
					<year>2000</year>
					<article-title>Coefficients of determination for multiple logistic regression analysis</article-title>
					<source>The American Statistician</source>
					<volume>54</volume>
					<issue>1</issue>
					<fpage>17</fpage>
					<lpage>24</lpage>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.1080/00031305.2000.10474502</pub-id>
				</element-citation>
				<mixed-citation>Menard, S. (2000) Coefficients of determination for multiple logistic regression analysis. 
					<italic>The American Statistician</italic>, 
					<italic>54</italic>(1), pp. 17-24. DOI: 10.1080/00031305.2000.10474502
				</mixed-citation>
			</ref>
			<ref id="B47">
				<element-citation publication-type="book">
					<person-group person-group-type="author">
						<name>
							<surname>Menard</surname>
							<given-names>S.</given-names>
						</name>
					</person-group>
					<year>2002</year>
					<source>Applied logistic regression analysis</source>
					<publisher-loc>Thousand Oaks</publisher-loc>
					<publisher-name>Sage Publications</publisher-name>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.4135/9781412983433</pub-id>
				</element-citation>
				<mixed-citation>Menard, S. (2002). 
					<italic>Applied logistic regression analysis</italic>. Thousand Oaks: Sage Publications. DOI: 10.4135/9781412983433
				</mixed-citation>
			</ref>
			<ref id="B48">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Menard</surname>
							<given-names>S.</given-names>
						</name>
					</person-group>
					<year>2004</year>
					<article-title>Six approaches to calculating standardized logistic regression coefficients</article-title>
					<source>The American Statistician</source>
					<volume>58</volume>
					<issue>3</issue>
					<fpage>218</fpage>
					<lpage>223</lpage>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.1198/000313004X946</pub-id>
				</element-citation>
				<mixed-citation>Menard, S. (2004) Six approaches to calculating standardized logistic regression coefficients. 
					<italic>The American Statistician</italic>, 
					<italic>58</italic>(3), pp. 218-223. DOI: 10.1198/000313004X946
				</mixed-citation>
			</ref>
			<ref id="B49">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Nelder</surname>
							<given-names>J.</given-names>
						</name>
						<name>
							<surname>Wedderburn</surname>
							<given-names>R.</given-names>
						</name>
					</person-group>
					<year>1972</year>
					<article-title>Generalized linear models</article-title>
					<source>Journal of the Royal Statistical Society: Series A (General)</source>
					<volume>135</volume>
					<issue>3</issue>
					<fpage>370</fpage>
					<lpage>384</lpage>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.2307/2344614</pub-id>
				</element-citation>
				<mixed-citation>Nelder, J. &amp; Wedderburn, R. (1972) Generalized linear models. 
					<italic>Journal of the Royal Statistical Society: Series A (General)</italic>, 135(3), pp. 370-384. DOI: 10.2307/2344614
				</mixed-citation>
			</ref>
			<ref id="B50">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Nicolau</surname>
							<given-names>J.</given-names>
						</name>
					</person-group>
					<year>2000</year>
					<article-title>An analysis of the 2002 presidential elections using logistic regression</article-title>
					<source>Brazilian Political Science Review</source>
					<volume>1</volume>
					<issue>1</issue>
					<fpage>125</fpage>
					<lpage>135</lpage>
				</element-citation>
				<mixed-citation>Nicolau, J. (2000) An analysis of the 2002 presidential elections using logistic regression. 
					<italic>Brazilian Political Science Review</italic>, 1(1), pp. 125-135.
				</mixed-citation>
			</ref>
			<ref id="B51">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>O&#x0027;Brien</surname>
							<given-names>S. M</given-names>
						</name>
						<name>
							<surname>Dunson</surname>
							<given-names>D.</given-names>
						</name>
					</person-group>
					<year>2004</year>
					<article-title>Bayesian multivariate logistic regression</article-title>
					<source>Biometrics</source>
					<volume>60</volume>
					<issue>3</issue>
					<fpage>739</fpage>
					<lpage>746</lpage>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.1111/j.0006-341X.2004.00224.x</pub-id>
				</element-citation>
				<mixed-citation>O&#x0027;Brien, S. M &amp; Dunson, D. (2004) Bayesian multivariate logistic regression. 
					<italic>Biometrics</italic>, 60(3), pp. 739-746. DOI: 10.1111/j.0006-341X.2004.00224.x
				</mixed-citation>
			</ref>
			<ref id="B52">
				<element-citation publication-type="book">
					<person-group person-group-type="author">
						<name>
							<surname>O&#x0027;Connell</surname>
							<given-names>A.</given-names>
						</name>
					</person-group>
					<year>2006</year>
					<source>Logistic regression models for ordinal response variables</source>
					<publisher-loc>Thousand Oaks</publisher-loc>
					<publisher-name>Sage Publications</publisher-name>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.4135/9781412984812</pub-id>
				</element-citation>
				<mixed-citation>O&#x0027;Connell, A. (2006) 
					<italic>Logistic regression models for ordinal response variables</italic>. Thousand Oaks: Sage Publications. DOI: 10.4135/9781412984812
				</mixed-citation>
			</ref>
			<ref id="B53">
				<element-citation publication-type="book">
					<person-group person-group-type="author">
						<name>
							<surname>Pampel</surname>
							<given-names>F.</given-names>
						</name>
					</person-group>
					<year>2000</year>
					<source>Logistic regression: A primer</source>
					<publisher-loc>Thousand Oaks</publisher-loc>
					<publisher-name>Sage Publications</publisher-name>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.4135/9781412984805</pub-id>
				</element-citation>
				<mixed-citation>Pampel, F. (2000) 
					<italic>Logistic regression: A primer.</italic> Thousand Oaks: Sage Publications. DOI: 10.4135/9781412984805
				</mixed-citation>
			</ref>
			<ref id="B54">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Paranhos</surname>
							<given-names>R.</given-names>
						</name>
						<name>
							<surname>Figueiredo</surname>
							<given-names>D.</given-names>
							<suffix>Filho</suffix>
						</name>
						<name>
							<surname>Rocha</surname>
							<given-names>E.</given-names>
						</name>
						<name>
							<surname>Carmo</surname>
							<given-names>E.</given-names>
						</name>
					</person-group>
					<year>2013</year>
					<article-title>A import&#x00E2;ncia da replicabilidade na ci&#x00EA;ncia pol&#x00ED;tica: o caso do SIGOBR</article-title>
					<source>Revista Pol&#x00ED;tica Hoje</source>
					<volume>22</volume>
					<issue>2</issue>
					<fpage>213</fpage>
					<lpage>229</lpage>
				</element-citation>
				<mixed-citation>Paranhos, R.; Figueiredo Filho, D.; Rocha, E. &amp; Carmo, E. (2013) A import&#x00E2;ncia da replicabilidade na ci&#x00EA;ncia pol&#x00ED;tica: o caso do SIGOBR. 
					<italic>Revista Pol&#x00ED;tica Hoje</italic>, 
					<italic>22</italic>(2), pp. 213-229.
				</mixed-citation>
			</ref>
			<ref id="B55">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Pardoe</surname>
							<given-names>I.</given-names>
						</name>
						<name>
							<surname>Cook</surname>
							<given-names>R.</given-names>
						</name>
					</person-group>
					<year>2002</year>
					<article-title>A graphical method for assessing the fit of a logistic regression model</article-title>
					<source>The American Statistician</source>
					<volume>56</volume>
					<issue>4</issue>
					<fpage>263</fpage>
					<lpage>272</lpage>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.1198/000313002560</pub-id>
				</element-citation>
				<mixed-citation>Pardoe, I. &amp; Cook, R. (2002) A graphical method for assessing the fit of a logistic regression model. 
					<italic>The American Statistician</italic>, 56(4), pp. 263-272. DOI: 10.1198/000313002560
				</mixed-citation>
			</ref>
			<ref id="B56">
				<element-citation publication-type="book">
					<person-group person-group-type="author">
						<name>
							<surname>Pedhazur</surname>
							<given-names>E.</given-names>
						</name>
					</person-group>
					<year>1982</year>
					<source>Multiple Regression in Behavioral Research</source>
					<publisher-loc>New York</publisher-loc>
					<publisher-name>Holt, Rinehart and Winston</publisher-name>
				</element-citation>
				<mixed-citation>Pedhazur, E. (1982) 
					<italic>Multiple Regression in Behavioral Research</italic>. New York: Holt, Rinehart and Winston.
				</mixed-citation>
			</ref>
			<ref id="B57">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Peixoto</surname>
							<given-names>V.</given-names>
						</name>
					</person-group>
					<year>2009</year>
					<article-title>Financiamento de campanhas: o Brasil em perspectiva comparada</article-title>
					<source>Perspectivas: revista de ci&#x00EA;ncias sociais</source>
					<volume>35</volume>
					<issue>s/n</issue>
					<fpage>91</fpage>
					<lpage>116</lpage>
				</element-citation>
				<mixed-citation>Peixoto, V. (2009) Financiamento de campanhas: o Brasil em perspectiva comparada. 
					<italic>Perspectivas: revista de ci&#x00EA;ncias sociais</italic>, 35(s/n), pp. 91-116.
				</mixed-citation>
			</ref>
			<ref id="B58">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Press</surname>
							<given-names>S.</given-names>
						</name>
						<name>
							<surname>Wilson</surname>
							<given-names>S.</given-names>
						</name>
					</person-group>
					<year>1997</year>
					<article-title>Choosing between logistic regression and discriminant analysis</article-title>
					<source>Journal of the American Statistical Association</source>
					<volume>73</volume>
					<issue>364</issue>
					<fpage>699</fpage>
					<lpage>705</lpage>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.1080/01621459.1978.10480080</pub-id>
				</element-citation>
				<mixed-citation>Press, S. &amp; Wilson, S. (197) Choosing between logistic regression and discriminant analysis. 
					<italic>Journal of the American Statistical Association</italic>, 73(364), pp. 699-705. DOI: 10.1080/01621459.1978.10480080
				</mixed-citation>
			</ref>
			<ref id="B59">
				<element-citation publication-type="webpage">
					<person-group person-group-type="author">
						<name>
							<surname>Revelle</surname>
							<given-names>W.</given-names>
						</name>
					</person-group>
					<year>2018</year>
					<article-title>Psych: Procedures for Personality and Psychological Research</article-title>
					<source>Northwestern University, Evanston, Illinois, USA</source>
					<comment>Dispon&#x00ED;vel em: </comment>
					<ext-link ext-link-type="uri" xlink:href="https://cran.r-project.org/web/packages/psych/">https://cran.r-project.org/web/packages/psych/</ext-link>
					<comment>Acesso em: </comment>
					<date-in-citation>28 set. 2020</date-in-citation>
				</element-citation>
				<mixed-citation>Revelle, W. (2018) 
					<italic>Psych: Procedures for Personality and Psychological Research</italic>. Northwestern University, Evanston, Illinois, USA. Dispon&#x00ED;vel em: 
					<ext-link ext-link-type="uri" xlink:href="https://cran.r-project.org/web/packages/psych/">https://cran.r-project.org/web/packages/psych/</ext-link>. Acesso em: 28 set. 2020.
				</mixed-citation>
			</ref>
			<ref id="B60">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Ribeiro</surname>
							<given-names>E.</given-names>
						</name>
						<name>
							<surname>Carreir&#x00E3;o</surname>
							<given-names>Y.</given-names>
						</name>
						<name>
							<surname>Borba</surname>
							<given-names>J.</given-names>
						</name>
					</person-group>
					<year>2011</year>
					<article-title>Sentimentos partid&#x00E1;rios e atitudes pol&#x00ED;ticas entre os brasileiros</article-title>
					<source>Opini&#x00E3;o P&#x00FA;blica</source>
					<volume>17</volume>
					<issue>2</issue>
					<fpage>333</fpage>
					<lpage>368</lpage>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.1590/S0104-62762011000200003</pub-id>
				</element-citation>
				<mixed-citation>Ribeiro, E.; Carreir&#x00E3;o, Y. &amp; Borba, J. (2011) Sentimentos partid&#x00E1;rios e atitudes pol&#x00ED;ticas entre os brasileiros. 
					<italic>Opini&#x00E3;o P&#x00FA;blica</italic>, 17(2), pp. 333-368. DOI: 10.1590/S0104-62762011000200003
				</mixed-citation>
			</ref>
			<ref id="B61">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Roberts</surname>
							<given-names>G.</given-names>
						</name>
						<name>
							<surname>Rao</surname>
							<given-names>N.</given-names>
						</name>
						<name>
							<surname>Kumar</surname>
							<given-names>S.</given-names>
						</name>
					</person-group>
					<year>1987</year>
					<article-title>Logistic regression analysis of sample survey data</article-title>
					<source>Biometrika</source>
					<volume>74</volume>
					<issue>1</issue>
					<fpage>1</fpage>
					<lpage>12</lpage>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.2307/2336016</pub-id>
				</element-citation>
				<mixed-citation>Roberts, G.; Rao, N. &amp; Kumar, S. (1987) Logistic regression analysis of sample survey data. 
					<italic>Biometrika</italic>, 74(1), pp. 1-12. DOI: 10.2307/2336016
				</mixed-citation>
			</ref>
			<ref id="B62">
				<element-citation publication-type="book">
					<person-group person-group-type="author">
						<name>
							<surname>Schwab</surname>
							<given-names>J.</given-names>
						</name>
					</person-group>
					<year>2002</year>
					<source>Multinomial logistic regression: Basic relationships and complete problems</source>
					<publisher-loc>Austin, Texas</publisher-loc>
					<publisher-name>University of Texas</publisher-name>
				</element-citation>
				<mixed-citation>Schwab, J. (2002) 
					<italic>Multinomial logistic regression: Basic relationships and complete problems</italic>. Austin, Texas: University of Texas.
				</mixed-citation>
			</ref>
			<ref id="B63">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Soares</surname>
							<given-names>G.</given-names>
						</name>
					</person-group>
					<year>2000</year>
					<article-title>Em busca da racionalidade perdida: alguns determinantes do voto no Distrito Federal</article-title>
					<source>Revista Brasileira de Ci&#x00EA;ncias Sociais</source>
					<volume>15</volume>
					<issue>43</issue>
					<fpage>5</fpage>
					<lpage>23</lpage>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.1590/S0102-69092000000200001</pub-id>
				</element-citation>
				<mixed-citation>Soares, G. (2000) Em busca da racionalidade perdida: alguns determinantes do voto no Distrito Federal. 
					<italic>Revista Brasileira de Ci&#x00EA;ncias Sociais</italic>, 15(43), pp. 5-23. DOI: 10.1590/S0102-69092000000200001
				</mixed-citation>
			</ref>
			<ref id="B64">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Speck</surname>
							<given-names>B.</given-names>
						</name>
						<name>
							<surname>Mancuso</surname>
							<given-names>W.</given-names>
						</name>
					</person-group>
					<year>2013</year>
					<article-title>O que faz a diferen&#x00E7;a? Gastos de campanha, capital pol&#x00ED;tico, sexo e contexto municipal nas elei&#x00E7;&#x00F5;es para prefeito em 2012</article-title>
					<source>Cadernos Adenauer</source>
					<volume>14</volume>
					<issue>2</issue>
					<fpage>109</fpage>
					<lpage>126</lpage>
				</element-citation>
				<mixed-citation>Speck, B. &amp; Mancuso, W. (2013) O que faz a diferen&#x00E7;a? Gastos de campanha, capital pol&#x00ED;tico, sexo e contexto municipal nas elei&#x00E7;&#x00F5;es para prefeito em 2012. 
					<italic>Cadernos Adenauer</italic>, 14(2), pp. 109-126.
				</mixed-citation>
			</ref>
			<ref id="B65">
				<element-citation publication-type="book">
					<person-group person-group-type="author">
						<name>
							<surname>Stock</surname>
							<given-names>J.</given-names>
						</name>
						<name>
							<surname>Watson</surname>
							<given-names>M.</given-names>
						</name>
					</person-group>
					<year>2015</year>
					<source>Introduction to Econometrics</source>
					<edition>3&#x00AA; Edition</edition>
					<publisher-loc>United Kingdom</publisher-loc>
					<publisher-name>Pearson</publisher-name>
				</element-citation>
				<mixed-citation>Stock, J. &amp; Watson, M. (2015) 
					<italic>Introduction to Econometrics</italic>. 3&#x00AA; Edition. United Kingdom: Pearson.
				</mixed-citation>
			</ref>
			<ref id="B66">
				<element-citation publication-type="book">
					<person-group person-group-type="author">
						<name>
							<surname>Tabachnick</surname>
							<given-names>B.</given-names>
						</name>
						<name>
							<surname>Fidell</surname>
							<given-names>L.</given-names>
						</name>
						<name>
							<surname>Ullman</surname>
							<given-names>J.</given-names>
						</name>
					</person-group>
					<year>2007</year>
					<source>Using multivariate statistics</source>
					<publisher-loc>Boston, MA</publisher-loc>
					<publisher-name>Pearson</publisher-name>
				</element-citation>
				<mixed-citation>Tabachnick, B.; Fidell, L. &amp; Ullman, J. (2007) 
					<italic>Using multivariate statistics</italic>. Boston, MA: Pearson.
				</mixed-citation>
			</ref>
			<ref id="B67">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Taylor</surname>
							<given-names>J.</given-names>
						</name>
						<name>
							<surname>Yu</surname>
							<given-names>M.</given-names>
						</name>
					</person-group>
					<year>2002</year>
					<article-title>Bias and Efficiency Loss Due to Categorizing an Explanatory Variable</article-title>
					<source>Journal of Multivariate Analysis</source>
					<volume>83</volume>
					<issue>s/n</issue>
					<fpage>248</fpage>
					<lpage>263</lpage>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.1006/jmva.2001.2045</pub-id>
				</element-citation>
				<mixed-citation>Taylor, J. &amp; Yu, M. (2002) Bias and Efficiency Loss Due to Categorizing an Explanatory Variable. 
					<italic>Journal of Multivariate Analysis</italic>, 83(s/n), pp. 248&#x2013;263. DOI: 10.1006/jmva.2001.2045
				</mixed-citation>
			</ref>
			<ref id="B68">
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Wong</surname>
							<given-names>G.</given-names>
						</name>
						<name>
							<surname>Mason</surname>
							<given-names>W.</given-names>
						</name>
					</person-group>
					<year>1985</year>
					<article-title>The hierarchical logistic regression model for multilevel analysis</article-title>
					<source>Journal of the American Statistical Association</source>
					<volume>80</volume>
					<issue>391</issue>
					<fpage>513</fpage>
					<lpage>524</lpage>
					<comment>DOI: </comment>
					<pub-id pub-id-type="doi">10.1080/01621459.1985.10478148</pub-id>
				</element-citation>
				<mixed-citation>Wong, G. &amp; Mason, W. (1985) The hierarchical logistic regression model for multilevel analysis. 
					<italic>Journal of the American Statistical Association</italic>, 80(391), pp. 513-524. DOI: 10.1080/01621459.1985.10478148
				</mixed-citation>
			</ref>
		</ref-list>
		<ref-list>
			<title>Newspaper articles</title>
			<ref id="B69">
				<element-citation publication-type="book">
					<person-group person-group-type="author">
						<collab>O julgamento do Mensal&#x00E3;o</collab>
					</person-group>
					<year>2012</year>
					<source>Folha de S&#x00E3;o Paulo</source>
					<publisher-loc>S&#x00E3;o Paulo</publisher-loc>
					<comment>12. Jun. Available through: </comment>
					<ext-link ext-link-type="uri" xlink:href="https://www1.folha.uol.com.br/especial/2012/ojulgamentodomensalao/">https://www1.folha.uol.com.br/especial/2012/ojulgamentodomensalao/</ext-link>
					<comment>Accessed at: </comment>
					<date-in-citation>10 nov. 2020</date-in-citation>
				</element-citation>
				<mixed-citation>O julgamento do Mensal&#x00E3;o (2012) 
					<italic>Folha de S&#x00E3;o Paulo</italic>. S&#x00E3;o Paulo. 12. Jun. Available through: &lt;
					<ext-link ext-link-type="uri" xlink:href="https://www1.folha.uol.com.br/especial/2012/ojulgamentodomensalao/">https://www1.folha.uol.com.br/especial/2012/ojulgamentodomensalao/</ext-link>&gt;. Accessed at: 10 nov. 2020.
				</mixed-citation>
			</ref>
			<ref id="B70">
				<element-citation publication-type="webpage">
					<person-group person-group-type="author">
						<collab>Entenda o Esc&#x00E2;ndalo dos sanguessugas</collab>
					</person-group>
					<year>2006</year>
					<source>Estado de S&#x00E3;o Paulo. S&#x00E3;o Paulo</source>
					<comment>11.dec. Available through: </comment>
					<ext-link ext-link-type="uri" xlink:href="https://politica.estadao.com.br/noticias/geral,entenda-o-escandalo-dos-sanguessugas,20061211p60113">https://politica.estadao.com.br/noticias/geral,entenda-o-escandalo-dos-sanguessugas,20061211p60113</ext-link>
					<comment>Accessed at: </comment>
					<date-in-citation>10 nov. 2020</date-in-citation>
				</element-citation>
				<mixed-citation>Entenda o Esc&#x00E2;ndalo dos sanguessugas (2006) Estado de S&#x00E3;o Paulo. S&#x00E3;o Paulo. 11.dec. Available through: &lt;
					<ext-link ext-link-type="uri" xlink:href="https://politica.estadao.com.br/noticias/geral,entenda-o-escandalo-dos-sanguessugas,20061211p60113">https://politica.estadao.com.br/noticias/geral,entenda-o-escandalo-dos-sanguessugas,20061211p60113</ext-link>&gt;. Accessed at: 10 nov. 2020.
				</mixed-citation>
			</ref>
		</ref-list>
		<fn-group>
			<fn fn-type="other" id="fn1">
				<label>1</label>
				<p>Replication materials available at: &lt;https://osf.io/nv4ae/&gt;. This paper benefitted from the comments of professor Jairo Nicolau and the suggestions made by 
					<italic>Revista de Sociologia e Pol&#x00ED;tica</italic>&#x2018;s anonymous reviewers. We also thank the 
					<italic>Berkeley Initiative for Transparency in the Social Sciences</italic> and the 
					<italic>Teaching Integrity in Empirical Research</italic>.
				</p>
			</fn>
			<fn fn-type="other" id="fn2">
				<label>2</label>
				<p>For a brief review of Mensal&#x00E3;o, see O julgamento do Mensal&#x00E3;o (2012).</p>
			</fn>
			<fn fn-type="other" id="fn3">
				<label>3</label>
				<p>For a explanation of the Sanguessugas scandal, see Entenda o Esc&#x00E2;ndalo dos sanguessugas (2006).</p>
			</fn>
			<fn fn-type="other" id="fn4">
				<label>4</label>
				<p>See the course on logistic regression offered by 
					<italic>Coursera</italic> (
					<ext-link ext-link-type="uri" xlink:href="https://www.coursera.org/course/logisticregression">https://www.coursera.org/course/logisticregression</ext-link>). We also suggest the categorical data analysis course given by the Intensive Training on Quantitative Methodology, from the Federal University of Minas Gerais (MQ &#x2013; UFMG).
				</p>
			</fn>
			<fn fn-type="other" id="fn5">
				<label>5</label>
				<p>We will not discuss the mathematical foundations of logistic regression. For readers interested in the topic, we suggest Long (1977) and Pampel (2000).</p>
			</fn>
			<fn fn-type="other" id="fn6">
				<label>6</label>
				<p>There are extensions of the logistic model that enable modelling the variation of ordinal (ordinal logistic regression) and polychotomous variables (multinomial logistic regression).</p>
			</fn>
			<fn fn-type="other" id="fn7">
				<label>7</label>
				<p>Nelder and Wedderburn (1972) demonstrated that it is possible to use the same algorithm to estimate models of the exponential family, such as Logistic, Probit, Poisson, Gama, and Inverse Normal. Do not worry about the formulas for these models. The important thing is to understand what each of them are for, when they should be used, and how the coefficients must be interpreted.</p>
			</fn>
			<fn fn-type="other" id="fn8">
				<label>8</label>
				<p>Hair 
					<italic>et al.</italic> (2009) state that homoscedasticity is the assumption that the dependent variable displays equal levels of variance over a range of the predictor variable (Hair 
					<italic>et al.</italic>, 2009, p. 83). 2013, p. 77
				</p>
			</fn>
			<fn fn-type="other" id="fn9">
				<label>9</label>
				<p>For Hair 
					<italic>et al.</italic> (2009), an implied assumption for all multivariate analysis techniques based on correlational measures of association, including multiple linear regression and logistic regression, is linearity (Hair 
					<italic>et al.</italic>, 2009, p. 85).
				</p>
			</fn>
			<fn fn-type="other" id="fn10">
				<label>10</label>
				<p>One estimator is the 
					<italic>Best Linear Unbiased Estimator</italic>, when the following properties are satisfied. Best means efficient, producing the least variance, linear means the type of relationship expected between parameters, and unbiased concerns the sampling distribution of the estimator. A biased estimator is one that systematically over- or underestimates the value of the population parameter.
				</p>
			</fn>
			<fn fn-type="other" id="fn11">
				<label>11</label>
				<p>The data are available at: &lt;http://www.ats.ucla.edu/stat/stata/examples/alr2/alr2stata1.htm&gt;.</p>
			</fn>
			<fn fn-type="other" id="fn12">
				<label>12</label>
				<p>A logistic regression also supports variables with more than two categories. When there is no hierarchy between the category, such as with the distribution of civil status, we should use a multinomial regression. On the other hand, an ordinal logistic regression is ideal to model the distribution of ordinal variables, that is, when there is a structure of intensity between the categories.</p>
			</fn>
			<fn fn-type="other" id="fn13">
				<label>13</label>
				<p>Categorizing variables tends to produce biased and inefficient estimates (Taylor &amp; Yu, 2002). Given this, we emphasize the term &#x201C;originally dichotomous&#x201D;, and recommend never reducing the level of measurement for continuous, discrete, or ordinal variables with the aim of applying logistic regression models. Still in doubt? Check Fernandes 
					<italic>et al.</italic> (2019).
				</p>
			</fn>
			<fn fn-type="other" id="fn14">
				<label>14</label>
				<p>When the correlation is very high (some use the golden rule of r &#x2265; 0,90), the coefficients&#x2019; standard error is large, hindering the evaluation of the relative importance of the explanatory variables. To better understand the problems that high levels of correlation among independent variables may generate, see Figueiredo, Silva, and Domingos (2015).</p>
			</fn>
			<fn fn-type="other" id="fn15">
				<label>15</label>
				<p>For an introduction on how to detect outliers, see Figueiredo Filho and Silva (2016), available at: &lt;https://cienciapolitica.org.br/system/files/documentos/eventos/2017/04/outlier-que-pertuba-seu-sono-como-identificar-e-manejar.pdf&gt;.</p>
			</fn>
			<fn fn-type="other" id="fn16">
				<label>16</label>
				<p>A researcher may provide the data at Harvard University&#x2019;s 
					<italic>Dataverse</italic>. The 
					<italic>Open Science Framework</italic> may also be used to make available data for broader projects. In Brazil, we suggest the Social Information Consortium (CIS).
				</p>
			</fn>
			<fn fn-type="other" id="fn17">
				<label>17</label>
				<p>In a linear model, the regression coefficient is represented as the variation observed on the dependent variable (Y) when the independent variable (X) increases in one unit. In a logistic regression, the coefficient indicates the variation in the logarithm of the chance for the dependent variable by increasing the explanatory variable in one unit.</p>
			</fn>
			<fn fn-type="other" id="fn18">
				<label>18</label>
				<p>Readers unfamiliar with the concept of chance should consult the Methodological 
					<xref ref-type="app" rid="app1">Appendix</xref> of this article before reading further. For a more detailed treatment, see 
					<xref ref-type="bibr" rid="B29">Hilbe (2009)</xref>.
				</p>
			</fn>
			<fn fn-type="other" id="fn19">
				<label>19</label>
				<p>When interpreting the statistical significance of the confidence interval of the odds regression coefficient, we must observe if the interval includes the value one (1). If so, we are faced with a non-significant result. For example, in a confidence interval in which the coefficient varies between 0,8 and 1,6, it is not possible to reject the null hypothesis.</p>
			</fn>
			<fn fn-type="other" id="fn20">
				<label>20</label>
				<p>Following best scientific practices, the authors made the data and scripts available at the following website: &lt;http://thedata.harvard.edu/dvn/dv/felipenunes&gt;.</p>
			</fn>
			<fn fn-type="other" id="fn21">
				<label>21</label>
				<p>The main advantage of using 0/1 coding is that the distribution&#x2019;s mean will be equal to the proportion of 1 cases in the sample. In a distribution with 100 occurrences, in which 25 cases have been coded as 1, the mean will be 0.25, which represents exactly the proportion of events coded as 1.</p>
			</fn>
			<fn fn-type="other" id="fn22">
				<label>22</label>
				<p>Castro and Nunes (2014) estimated the regression model from a probit link function. The logit function is better suited for small samples (n &lt; 20) given that it presents a higher convergence rate. For large samples, on the other hand, there are no significant differences among these link functions. For more information on the topic, see Freitas (2013).</p>
			</fn>
			<fn fn-type="other" id="fn23">
				<label>23</label>
				<p>The researcher must make sure that no category has a distribution smaller than 5%. This is due to the phenomenon being then categorized as rare, and specific corrections to deal with this situation are needed. For interested readers, see King and Zeng (2001).</p>
			</fn>
			<fn fn-type="other" id="fn24">
				<label>24</label>
				<p>These finds diverge residually from the information reported in Tables 4 and 5 by Castro and Nunes (2014), which indicate 9 reelections out of a total of 50 representatives, equaling 18%.</p>
			</fn>
			<fn fn-type="other" id="fn25">
				<label>25</label>
				<p>And this can be calculated from the odds ratio, which is calculated by the dividing the chances of reelection for each group, in this case, 1.9/0.22. That is, candidates not involved in corruption scandals have an 8 times higher chance of being reelected when compared to the deputies named in the Mensal&#x00E3;o and/or Sanguessugas schemes, as measured by Castro and Nunes (2014).</p>
			</fn>
			<fn fn-type="other" id="fn26">
				<label>26</label>
				<p>For Garson (2011), the omnibus test can be interpreted as a test for the joint capacity of all the predictors in the model to predict the response (dependent) variable. A significant result indicates that the fit is adequate to the data, suggesting that at least one of the predictors is significantly related to the response variable.</p>
			</fn>
			<fn fn-type="other" id="fn27">
				<label>27</label>
				<p>In the original, &#x201C;the successful allocation of pork does not present, subverting expectations, positive association with reelection. The result seems to be null and irrelevant to explain the chances of reelection in 2006, also when socioeconomic and institutional variables are included in the model&#x201D;. (Castro &amp; Nunes, 2014, p. 42).</p>
			</fn>
			<fn fn-type="other" id="fn28">
				<label>28</label>
				<p>The maximum likelihood method is an iterative process that aims to fit the model through several repetitions. However, sometimes the model simply does not converge. This can happen for several reasons, from problems in the algorithms uses to estimate the link function to a strongly asymmetrical distribution of the independent variables.</p>
			</fn>
			<fn fn-type="other" id="fn29">
				<label>29</label>
				<p>There is a debate on the advantages and limitations of R
					<sup>2</sup> as a synthetic measure to evaluate the quality of fit of logistic regression models. To our knowledge, King (1986) is the first systematic alert on the issue in empirical research in Political Science. Figueiredo Filho, Silva J&#x00FA;nior, and Rocha (2012) have a pedagogical discussion on the topic.
				</p>
			</fn>
			<fn fn-type="other" id="fn30">
				<label>30</label>
				<p>Hair 
					<italic>et al.</italic> (2009) state that a logistic model&#x2019;s fit can be evaluated by two main procedures: (1) pseudo R
					<sup>2</sup>s, similarly to a linear regression and (2) by estimating the predictive capacity of the model.
				</p>
			</fn>
			<fn fn-type="other" id="fn31">
				<label>31</label>
				<p>There are also McFadden's pseudo R
					<sup>2</sup>, McKelvey and Savoina pseudo R
					<sup>2</sup>, McFadden pseudo R
					<sup>2</sup>, Cragg and Uhler pseudo R
					<sup>2</sup> and Efron pseudo R
					<sup>2</sup>. For the reader interested in deepening their knowledge on the subject, see Hagle and Mitchell (1992) and 
					<xref ref-type="bibr" rid="B46">Menard (2000)</xref>.
				</p>
			</fn>
			<fn fn-type="other" id="fn32">
				<label>32</label>
				<p>This section was based on Schwab (2002).</p>
			</fn>
			<fn fn-type="other" id="fn33">
				<p>A produ&#x00E7;&#x00E3;o desse manuscrito foi viabilizada atrav&#x00E9;s do patroc&#x00ED;nio fornecido pelo Centro Universit&#x00E1;rio Internacional Uninter &#x00E0; 
					<italic>Revista de Sociologia e Pol&#x00ED;tica</italic>.
				</p>
			</fn>
		</fn-group>
		<app-group>
			<app id="app1">
				<title>Appendix</title>
				<p>In this section, we present some information that can help researchers to interpret logistic regression coefficients. In particular, we examine the interpretation of the odds ratio. In addition, we list some learning tools.</p>
				<p>&#x2022; Understanding the odds ratio
					<xref ref-type="fn" rid="fn32">
						<sup>32</sup>
					</xref>
				</p>
				<p>The term odds ratio is not as disseminated in Political Science applied research as are mean or probability. Usually, since the researcher is comparing groups/categories, they are interested in analyzing which group/category has a better chance of occurring in relation to another group/category. Consider the following example: suppose that the probability (p) of a certain event occurring is 0,9. Thus, when calculating the complementary event, q = 1 &#x2013; p, we have 1 &#x2013; 0,9 = 0,1. Chance is the division of the probability of occurrence (p) by the probability of non-occurrence (q). Consequently, 0,9/0,1 = 9. It is stated, then, that the chance for success is 9 to 1. Alternatively, the chance for failure is 0,1/0,9 = 0,11. We say then that the chance for failure is 1 to 9. Unlike probability, which can only take on values between 0 and 1, chance can vary between 0 and infinity. When the probability of an event occurring is greater than the probability of it not occurring, its chance will be greater than 1. When the probability of it not occurring is greater, chance will be smaller than 1. When probabilities are equal (e.g., tossing a coin), chance is equal to 1. Given the pedagogical purposes of this paper, it is relevant to replicate the data from Schawb (2002), to better grasp this concept (
					<xref ref-type="table" rid="t1a">Table 1A</xref>).
				</p>
				<table-wrap id="t1a">
					<label>Table 1A</label>
					<caption>
						<title>Frequency</title>
					</caption>
					<table frame="hsides" rules="groups">
						<colgroup width="33%">
							<col/>
							<col/>
							<col/>
						</colgroup>
						<thead style="border-top: thin solid; border-bottom: 2px solid; border-color: #000000">
							<tr>
								<th align="left" valign="top">Sentence</th>
								<th align="center" valign="top">N</th>
								<th align="center" valign="top">%</th>
							</tr>
						</thead>
						<tbody style="border-bottom: thin solid; border-color: #000000">
							<tr>
								<td align="left" valign="top">Death penalty</td>
								<td align="center" valign="top">50</td>
								<td align="center" valign="top">34</td>
							</tr>
							<tr>
								<td align="left" valign="top">Life in prison</td>
								<td align="center" valign="top">97</td>
								<td align="center" valign="top">66</td>
							</tr>
							<tr>
								<td align="left" valign="top">
									<bold>Total</bold>
								</td>
								<td align="center" valign="top">
									<bold>147</bold>
								</td>
								<td align="center" valign="top">
									<bold>100.0</bold>
								</td>
							</tr>
						</tbody>
					</table>
					<table-wrap-foot>
						<attrib>Source: Schwab (2002).</attrib>
					</table-wrap-foot>
				</table-wrap>
				<p>
					<xref ref-type="table" rid="t1a">Table 1A</xref> shows that 34% of inmates were sentenced to the death penalty (n&nbsp;= 50/147). This means that the probability of this event occurring is 0f 0,34. Alternatively, the chance of being given capital punishment is 0,516 (50/97). Another way of saying this is that the chances are approximately half of being sentenced to capital punishment in relation to spending life in prison. Lastly, it is possible to invert the interpretation and consider life in prison roughly two times more likely than the death penalty.
				</p>
				<p>So far, there are no independent variables. What the logistic model will inform is the impact of a given variable on the chance of a dependent variable occurring. For example, consider the relationship between race and sentence type (
					<xref ref-type="table" rid="t2a">Table 2A</xref>).
				</p>
				<table-wrap id="t2a">
					<label>Table 2A</label>
					<caption>
						<title>Sentence type by color</title>
					</caption>
					<table frame="hsides" rules="groups">
						<colgroup width="25%">
							<col/>
							<col/>
							<col/>
							<col/>
						</colgroup>
						<thead style="border-top: thin solid; border-bottom: 2px solid; border-color: #000000">
							<tr>
								<th align="left" valign="top">Sentence</th>
								<th align="center" valign="top">Black</th>
								<th align="center" valign="top">Non-black</th>
								<th align="center" valign="top">Total</th>
							</tr>
						</thead>
						<tbody style="border-bottom: thin solid; border-color: #000000">
							<tr>
								<td align="left" valign="top">Death penalty</td>
								<td align="center" valign="top">28</td>
								<td align="center" valign="top">22</td>
								<td align="center" valign="top">50</td>
							</tr>
							<tr>
								<td align="left" valign="top">Life in prison</td>
								<td align="center" valign="top">45</td>
								<td align="center" valign="top">52</td>
								<td align="center" valign="top">97</td>
							</tr>
							<tr>
								<td align="left" valign="top">
									<bold>Total</bold>
								</td>
								<td align="center" valign="top">
									<bold>73</bold>
								</td>
								<td align="center" valign="top">
									<bold>74</bold>
								</td>
								<td align="center" valign="top">
									<bold>147</bold>
								</td>
							</tr>
						</tbody>
					</table>
					<table-wrap-foot>
						<attrib>Source: Schwab (2002).</attrib>
					</table-wrap-foot>
				</table-wrap>
				<p>It is possible, then, to calculate the chance for each specific group: black people and non-black people. For black people, we have 28/45 = 0,622. For non-black people, we have 22/52 = 0,423. The impact of being black can be represented by the division of a black person receiving the death penalty and a non-black person receiving capital punishment (0,423). 0,622/0,423 = 1,47. For the interpretation: a) black people have 1,47 higher chance of receiving the death penalty than non-black people; b) being black increases by 47% the chances of receiving capital punishment (1,47-1*100).</p>
				<sec>
					<title>Learning tools</title>
					<p>
						<ext-link ext-link-type="uri" xlink:href="http://www.icpsr.umich.edu/icpsrweb/sumprog/">http://www.icpsr.umich.edu/icpsrweb/sumprog/</ext-link>
					</p>
					<p>Internationally, the 
						<italic>Summer Program in Quantitative Methods of Social Research</italic> (ICPRS) is one of the main initiatives in the dissemination of research methods and techniques.
					</p>
					<p>
						<ext-link ext-link-type="uri" xlink:href="http://www.fafich.ufmg.br/~mq/index.html">http://www.fafich.ufmg.br/~mq/index.html</ext-link>
					</p>
					<p>Intensive course in Quantitative Methodology in the Humanities. It is the most traditional course in teaching of research methods and techniques in Social Sciences in Brazil.</p>
					<p>
						<ext-link ext-link-type="uri" xlink:href="http://summerschool.ipsa.org/">http://summerschool.ipsa.org/</ext-link>
					</p>
					<p>Summer school organized by the International Political Science Association, the Department of Political Science, and the Institute for International Relations of the University of S&#x00E3;o Paulo (USP).</p>
					<p>
						<ext-link ext-link-type="uri" xlink:href="http://gking.harvard.edu/">http://gking.harvard.edu/</ext-link>
					</p>
					<p>Gary King shares papers on methodology, specific software, and databases for researchers interested in replication.</p>
					<p>
						<ext-link ext-link-type="uri" xlink:href="http://faculty.chass.ncsu.edu/garson/PA765/statnote.htm">http://faculty.chass.ncsu.edu/garson/PA765/statnote.htm</ext-link>
					</p>
					<p>David Garson presents different topics in multivariate statistics, using the Statistical Package for Social Sciences. At the end of each section, there is a suggested bibliography that can be used as reference to gain more in-depth knowledge on the topic.</p>
					<p>
						<ext-link ext-link-type="uri" xlink:href="http://www.statsoft.com/textbook/">http://www.statsoft.com/textbook/</ext-link>
					</p>
					<p>Has different multivariate techniques using the software Stastistica.</p>
					<p>
						<ext-link ext-link-type="uri" xlink:href="http://www.ats.ucla.edu/stat/">http://www.ats.ucla.edu/stat/</ext-link>
					</p>
					<p>Website for the University of California (UCLA) specialized in multivariate techniques. Here, the user finds applications for different software (SAS, SPSS, STATA, R, etc.), including video-classes and tutorials.</p>
					<p>
						<ext-link ext-link-type="uri" xlink:href="http://www.socr.ucla.edu/SOCR.html">http://www.socr.ucla.edu/SOCR.html</ext-link>
					</p>
					<p>At this address, the reader finds games, applications, analyses, among other tools related to teaching Statistics and different research techniques.</p>
					<p>
						<ext-link ext-link-type="uri" xlink:href="http://pan.oxfordjournals.org/">http://pan.oxfordjournals.org/</ext-link>
					</p>
					<p>Political Analysis is one of the most influential journals in contemporary Political Science and publishes papers in the field of methodology.</p>
					<p>
						<ext-link ext-link-type="uri" xlink:href="http://www.amstat.org/publications/jse/">http://www.amstat.org/publications/jse/</ext-link>
					</p>
					<p>Journal specialized in the publication of teaching and learning techniques in Statistics.</p>
					<p>
						<ext-link ext-link-type="uri" xlink:href="http://www.politicahoje.ufpe.br/index.php/politica">http://www.politicahoje.ufpe.br/index.php/politica</ext-link>
					</p>
					<p>The journal Pol&#x00ED;tica Hoje, from UFPE&#x2019;s Department of Political Science, recently published a special issue dedicated to Methodology and Epistemology in Political Science and International Relations.</p>
				</sec>
			</app>
		</app-group>
	</back>
	<!--sub-article article-type="translation" id="S01" xml:lang="pt">
		<front-stub>
			<article-categories>
				<subj-group subj-group-type="heading">
					<subject>Artigos Originais</subject>
				</subj-group>
			</article-categories>
			<title-group>
				<article-title>Leia este artigo se voc&#x00EA; quiser aprender regress&#x00E3;o log&#x00ED;stica</article-title>
			</title-group>
			<contrib-group>
				<contrib contrib-type="author">
					<contrib-id contrib-id-type="orcid">0000-0002-0249-517X</contrib-id>
					<name>
						<surname>Fernandes</surname>
						<given-names>Ant&#x00F4;nio Alves T&#x00F4;rres</given-names>
					</name>
					<xref ref-type="aff" rid="aff1">
						<sup>I</sup>
					</xref>
				</contrib>
				<contrib contrib-type="author">
					<contrib-id contrib-id-type="orcid">0000-0001-6982-2262</contrib-id>
					<name>
						<surname>Figueiredo</surname>
						<given-names>Dalson Britto</given-names>
						<suffix>Filho</suffix>
					</name>
					<xref ref-type="aff" rid="aff1">
						<sup>I</sup>
					</xref>
				</contrib>
				<contrib contrib-type="author">
					<contrib-id contrib-id-type="orcid">0000-0003-1717-523X</contrib-id>
					<name>
						<surname>Rocha</surname>
						<given-names>Enivaldo Carvalho da</given-names>
					</name>
					<xref ref-type="aff" rid="aff1">
						<sup>I</sup>
					</xref>
				</contrib>
				<contrib contrib-type="author">
					<contrib-id contrib-id-type="orcid">0000-0002-2257-8108</contrib-id>
					<name>
						<surname>Nascimento</surname>
						<given-names>Willber da Silva</given-names>
					</name>
					<xref ref-type="aff" rid="aff2">
						<sup>I</sup>
					</xref>
				</contrib>
				<aff id="aff2">
					<sup>I</sup>
					<institution content-type="normalized">Universidade Federal de Pernambuco</institution>
					<institution content-type="orgname">Universidade Federal de Pernambuco</institution>
					<institution content-type="orgdiv1">Programa de P&#x00F3;s-Gradua&#x00E7;&#x00E3;o em Ci&#x00EA;ncia Pol&#x00ED;tica</institution>
					<addr-line>
						<named-content content-type="city">Recife</named-content>
						<named-content content-type="state">PE</named-content>
					</addr-line>
					<country country="BR">Brazil</country>
					<institution content-type="original">IPrograma de P&#x00F3;s-Gradua&#x00E7;&#x00E3;o em Ci&#x00EA;ncia Pol&#x00ED;tica, Universidade Federal de Pernambuco, Recife, PE, Brasil.</institution>
				</aff>
			</contrib-group>
			<pub-date date-type="pub" publication-format="electronic">
				<day>19</day>
				<month>12</month>
				<year>2020</year>
			</pub-date>
			<pub-date date-type="collection" publication-format="electronic">
				<season>Oct-Dec</season>
				<year>2020</year>
			</pub-date>
			<volume>28</volume>
			<issue>74</issue>
			<elocation-id>006b</elocation-id>
			<history>
				<date date-type="received">
					<day>19</day>
					<month>10</month>
					<year>2019</year>
				</date>
				<date date-type="accepted">
					<day>07</day>
					<month>05</month>
					<year>2020</year>
				</date>
				<date date-type="received">
					<day>16</day>
					<month>05</month>
					<year>2020</year>
				</date>
			</history>
			<permissions>
				<copyright-year>2020</copyright-year>
				<license xml:lang="en" license-type="open-access" xlink:href="http://creativecommons.org/licenses/by-nc/4.0">
					<license-p>This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License which permits unrestricted non-commercial use, distribution, and reproduction in any medium provided the original work is properly cited.</license-p>
				</license>
			</permissions>
			<abstract>
				<title>RESUMO</title>
				<sec>
					<title>Introdu&#x00E7;&#x00E3;o:</title>
					<p>E se a minha vari&#x00E1;vel resposta for categ&#x00F3;rica bin&#x00E1;ria? Este artigo apresenta uma introdu&#x00E7;&#x00E3;o intuitiva &#x00E0; regress&#x00E3;o log&#x00ED;stica, t&#x00E9;cnica estat&#x00ED;stica mais adequada para lidar com vari&#x00E1;veis dependentes dicot&#x00F4;micas.</p>
				</sec>
				<sec>
					<title>Materiais e M&#x00E9;todos:</title>
					<p>estimamos o efeito dos esc&#x00E2;ndalos de corrup&#x00E7;&#x00E3;o sobre a chance de reelei&#x00E7;&#x00E3;o de candidatos concorrentes a deputado federal no Brasil a partir dos dados de Castro e Nunes (2014). Em particular, mostramos a implementa&#x00E7;&#x00E3;o computacional no R e explicamos a interpreta&#x00E7;&#x00E3;o substantiva dos resultados.</p>
				</sec>
				<sec>
					<title>Resultados:</title>
					<p>disponibilizamos todos os materiais de replica&#x00E7;&#x00E3;o, permitindo que estudantes e profissionais utilizem os procedimentos discutidos aqui em suas atividades de estudo e pesquisa.</p>
				</sec>
				<sec>
					<title>Discuss&#x00E3;o:</title>
					<p>esperamos incentivar o uso da regress&#x00E3;o log&#x00ED;stica e difundir a replicabilidade como ferramenta de ensino de an&#x00E1;lise de dados.</p>
				</sec>
			</abstract>
			<kwd-group xml:lang="pt">
				<title>PALAVRAS-CHAVE:</title>
				<kwd>regress&#x00E3;o</kwd>
				<kwd>regress&#x00E3;o log&#x00ED;stica</kwd>
				<kwd>replica&#x00E7;&#x00E3;o</kwd>
				<kwd>m&#x00E9;todos quantitativos</kwd>
				<kwd>transpar&#x00EA;ncia</kwd>
			</kwd-group>
		</front-stub>
		<body>
			<sec sec-type="intro">
				<title>I. Introdu&#x00E7;&#x00E3;o
					<xref ref-type="fn" rid="fn34">
						<sup>1</sup>
					</xref>
				</title>
				<p>O modelo linear de m&#x00ED;nimos quadrados ordin&#x00E1;rios (MQO) &#x00E9; uma das ferramentas mais utilizadas na Ci&#x00EA;ncia Pol&#x00ED;tica (
					<xref ref-type="bibr" rid="B41">Kruger &amp; Lewis-Beck, 2008)</xref>. Desde que os seus pressupostos sejam respeitados, os coeficientes estimados a partir de uma amostra aleat&#x00F3;ria fornecem a Melhor Estimativa Linear N&#x00E3;o Viesada (
					<italic>best linear unbiased estimator)</italic> dos par&#x00E2;metros populacionais (
					<xref ref-type="bibr" rid="B35">Kennedy, 2005</xref>). N&#x00E3;o viesada porque nem sobreestima nem subestima sistematicamente o valor do par&#x00E2;metro, e melhor porque apresenta a menor vari&#x00E2;ncia dentre todas as poss&#x00ED;veis estimativas (
					<xref ref-type="bibr" rid="B43">Lewis-Beck, 1980</xref>).
				</p>
				<p>E quando os pressupostos forem violados? Nesse caso, devemos adotar t&#x00E9;cnicas mais adequadas &#x00E0; natureza dos dados. Por exemplo, imagine uma pesquisa que investiga o impacto da receita de campanha sobre a chance de um candidato ser eleito ou n&#x00E3;o. Como a vari&#x00E1;vel dependente &#x00E9; bin&#x00E1;ria, alguns pressupostos do modelo de m&#x00ED;nimos quadrados s&#x00E3;o violados (homocedasticidade, linearidade e normalidade) e as estimativas podem ser inconsistentes. A regress&#x00E3;o log&#x00ED;stica &#x00E9; a melhor ferramenta para lidar com vari&#x00E1;veis dependentes dicot&#x00F4;micas, ou seja, quando o 
					<italic>y</italic> apenas pode assumir duas categorias: eleito ou n&#x00E3;o eleito; adotou a pol&#x00ED;tica ou n&#x00E3;o adotou; votou no presidente Bolsonaro ou n&#x00E3;o. Lottes, DeMaris e Adler (1996) argumentam que, apesar da popularidade da regress&#x00E3;o log&#x00ED;stica nas Ci&#x00EA;ncias Sociais, ainda existe grande confus&#x00E3;o a respeito de sua correta utiliza&#x00E7;&#x00E3;o. Pela nossa experi&#x00EA;ncia pedag&#x00F3;gica, essa dificuldade se explica pela escassez de material did&#x00E1;tico intuitivo. Al&#x00E9;m disso, muitos cursos de gradua&#x00E7;&#x00E3;o e p&#x00F3;s-gradua&#x00E7;&#x00E3;o, assim como livros did&#x00E1;ticos, encerram o conte&#x00FA;do em regress&#x00E3;o linear, o que reduz a dissemina&#x00E7;&#x00E3;o de outras t&#x00E9;cnicas de an&#x00E1;lise de dados.
				</p>
				<p>Para preencher essa lacuna, este artigo apresenta uma introdu&#x00E7;&#x00E3;o &#x00E0; regress&#x00E3;o log&#x00ED;stica. Nosso objetivo &#x00E9; facilitar a compreens&#x00E3;o da aplica&#x00E7;&#x00E3;o pr&#x00E1;tica dessa t&#x00E9;cnica. Em termos de audi&#x00EA;ncia, escrevemos para estudantes em est&#x00E1;gios iniciais de treinamento e professores que necessitam de materiais did&#x00E1;ticos para ministrar disciplinas de m&#x00E9;todos quantitativos. Metodologicamente, reproduzimos os dados de Castro e Nunes (2014) sobre a rela&#x00E7;&#x00E3;o entre envolvimento em esc&#x00E2;ndalos de corrup&#x00E7;&#x00E3;o (Mensal&#x00E3;o
					<xref ref-type="fn" rid="fn35">
						<sup>2</sup>
					</xref> e Sanguessugas
					<xref ref-type="fn" rid="fn36">
						<sup>3</sup>
					</xref>) e a chance de reelei&#x00E7;&#x00E3;o dos candidatos concorrentes ao cargo de deputado federal no Brasil em 2006. Todos os dados e 
					<italic>scripts</italic> est&#x00E3;o dispon&#x00ED;veis no s&#x00ED;tio eletr&#x00F4;nico do 
					<italic>Open Science Framework</italic> (OSF)
					<xref ref-type="fn" rid="fn37">
						<sup>4</sup>
					</xref>.
				</p>
				<p>Ao final, o leitor deve ser capaz de identificar quando a regress&#x00E3;o log&#x00ED;stica deve ser utilizada, implementar computacionalmente o modelo e interpretar os resultados. Estamos cientes de que este trabalho n&#x00E3;o substitui a leitura detalhada das fontes prim&#x00E1;rias sobre o assunto e de materiais mais t&#x00E9;cnicos. Apesar disso, esperamos facilitar a compreens&#x00E3;o da regress&#x00E3;o log&#x00ED;stica e disseminar a replicabilidade como ferramenta de ensino de an&#x00E1;lise de dados.</p>
				<p>O restante do trabalho est&#x00E1; dividido da seguinte forma: a pr&#x00F3;xima se&#x00E7;&#x00E3;o explica os fundamentos da regress&#x00E3;o log&#x00ED;stica. A terceira parte identifica os principais requisitos t&#x00E9;cnicos que devem ser satisfeitos para garantir que as estimativas do modelo sejam consistentes. A quarta se&#x00E7;&#x00E3;o descreve as principais estat&#x00ED;sticas que devem ser observadas. Por fim, apresentamos algumas recomenda&#x00E7;&#x00F5;es sobre como melhorar a qualidade do treinamento metodol&#x00F3;gico oferecido aos alunos da gradua&#x00E7;&#x00E3;o e p&#x00F3;s-gradua&#x00E7;&#x00E3;o em Ci&#x00EA;ncia Pol&#x00ED;tica no Brasil.</p>
			</sec>
			<sec>
				<title>II. A l&#x00F3;gica da regress&#x00E3;o log&#x00ED;stica
					<xref ref-type="fn" rid="fn38">
						<sup>5</sup>
					</xref>
				</title>
				<p>A utiliza&#x00E7;&#x00E3;o de vari&#x00E1;veis dependentes categ&#x00F3;ricas bin&#x00E1;rias &#x00E9; comum na pesquisa emp&#x00ED;rica em Ci&#x00EA;ncia Pol&#x00ED;tica. Por exemplo: votou ou n&#x00E3;o (Nicolau, 2007; 
					<xref ref-type="bibr" rid="B63">Soares, 2000</xref>), venceu ou perdeu a disputa eleitoral (
					<xref ref-type="bibr" rid="B64">Speck &amp; Mancuso, 2013</xref>; 
					<xref ref-type="bibr" rid="B57">Peixoto, 2009</xref>), aderiu &#x00E0; pol&#x00ED;tica p&#x00FA;blica ou n&#x00E3;o (
					<xref ref-type="bibr" rid="B21">Furlong, 1998</xref>), democracia ou n&#x00E3;o democracia (
					<xref ref-type="bibr" rid="B24">Goldsmith, Chalup &amp; Quinlan, 2008</xref>), iniciou guerra ou n&#x00E3;o (
					<xref ref-type="bibr" rid="B28">Henderson &amp; Singer, 2000</xref>), recorreu ou n&#x00E3;o de uma decis&#x00E3;o judicial (
					<xref ref-type="bibr" rid="B11">Epstein, Landes &amp; Posner, 2013</xref>). Para todas essas situa&#x00E7;&#x00F5;es a regress&#x00E3;o log&#x00ED;stica &#x00E9; a t&#x00E9;cnica mais adequada para modelar a varia&#x00E7;&#x00E3;o da vari&#x00E1;vel dependente em fun&#x00E7;&#x00E3;o de um conjunto de vari&#x00E1;veis independentes.
				</p>
				<p>Na regress&#x00E3;o log&#x00ED;stica a vari&#x00E1;vel dependente tem apenas duas categorias
					<xref ref-type="fn" rid="fn39">
						<sup>6</sup>
					</xref>. Em geral, a ocorr&#x00EA;ncia do evento de interesse &#x00E9; codificada como &#x201C;1&#x201D; e a aus&#x00EA;ncia como &#x201C;0&#x201D;. Lembrando que a codifica&#x00E7;&#x00E3;o altera o sinal dos coeficientes e, portanto, sua interpreta&#x00E7;&#x00E3;o substantiva. Para melhor entender o funcionamento da regress&#x00E3;o log&#x00ED;stica &#x00E9; necess&#x00E1;rio compreender a l&#x00F3;gica da an&#x00E1;lise de regress&#x00E3;o de forma geral. Vejamos a nota&#x00E7;&#x00E3;o cl&#x00E1;ssica do modelo linear:
				</p>
				<disp-formula id="eq3">
					<label>(1)</label>
					<mml:math id="m3" display="block">
						<mml:mrow>
							<mml:mtext>Y</mml:mtext>
							<mml:mo>=</mml:mo>
							<mml:mtext>&#x03B1;</mml:mtext>
							<mml:mo>+</mml:mo>
							<mml:mtext>&#x03B2;X</mml:mtext>
							<mml:mo>+</mml:mo>
							<mml:mtext>&#x03B5;</mml:mtext>
						</mml:mrow>
					</mml:math>
				</disp-formula>
				<p>Y representa a vari&#x00E1;vel dependente, ou seja, aquilo que queremos entender/explicar/predizer. X representa a vari&#x00E1;vel independente. O intercepto, (&#x03B1;), representa o valor de Y quando X assume valor zero. O coeficiente de regress&#x00E3;o, (&#x03B2;), representa a varia&#x00E7;&#x00E3;o observada em Y associada ao aumento de uma unidade em X. O termo estoc&#x00E1;stico, (&#x03B5;), representa o erro do modelo. Tecnicamente, &#x00E9; poss&#x00ED;vel estimar se existe rela&#x00E7;&#x00E3;o linear entre uma vari&#x00E1;vel dependente (Y) e diferentes vari&#x00E1;veis independentes. Al&#x00E9;m disso, o modelo permite observar a magnitude do efeito e testar a signific&#x00E2;ncia estat&#x00ED;stica dos coeficientes (p-valor e intervalos de confian&#x00E7;a). A regress&#x00E3;o log&#x00ED;stica pode ser interpretada como um caso particular de modelos lineares generalizados (MLG)
					<xref ref-type="fn" rid="fn40">
						<sup>7</sup>
					</xref> em que a vari&#x00E1;vel dependente &#x00E9; dicot&#x00F4;mica. A 
					<xref ref-type="fig" rid="f6">Figura 1</xref> compara os modelos linear e log&#x00ED;stico.
				</p>
				<fig id="f6">
					<label>Figura 1</label>
					<caption>
						<title>Reta de regress&#x00E3;o linear versus curva log&#x00ED;stica</title>
					</caption>
					<graphic xlink:href="0104-4478-rsocp-28-74-e006-gf06.tif"/>
					<attrib>Fonte: elabora&#x00E7;&#x00E3;o pr&#x00F3;pria, com base em Hair, A. 
						<italic>et al.</italic> (2019).
					</attrib>
				</fig>
				<p>Como a vari&#x00E1;vel dependente no modelo log&#x00ED;stico assume apenas dois valores (0 ou 1), a probabilidade predita pelo modelo tamb&#x00E9;m deve se limitar ao referido intervalo. Quando X (vari&#x00E1;vel independente) assume valores mais baixos, a probabilidade se aproxima de zero. No outro oposto, na medida em que X aumenta, a probabilidade se aproxima de um. Para Kleibaum e Klein (2010), o fato de que a fun&#x00E7;&#x00E3;o log&#x00ED;stica varia entre 0 e 1 explica a popularidade desse modelo. Isso porque como a natureza bin&#x00E1;ria da vari&#x00E1;vel dependente viola alguns pressupostos do modelo linear (homocedasticidade
					<xref ref-type="fn" rid="fn41">
						<sup>8</sup>
					</xref>, linearidade
					<xref ref-type="fn" rid="fn42">
						<sup>9</sup>
					</xref>, normalidade), a utiliza&#x00E7;&#x00E3;o do modelo linear para analisar vari&#x00E1;veis bin&#x00E1;rias pode gerar coeficientes ineficientes e viesados
					<xref ref-type="fn" rid="fn43">
						<sup>10</sup>
					</xref>. Para melhor compreender a rela&#x00E7;&#x00E3;o entre os modelos linear e log&#x00ED;stico reproduzimos os dados de Hosmer, Lemeshow e Sturdivant (2013) sobre a associa&#x00E7;&#x00E3;o entre idade e doen&#x00E7;as coron&#x00E1;rias (
					<xref ref-type="fig" rid="f7">Gr&#x00E1;fico 1</xref>)
					<xref ref-type="fn" rid="fn44">
						<sup>11</sup>
					</xref>.
				</p>
				<fig id="f7">
					<label>Gr&#x00E1;fico 1</label>
					<caption>
						<title>Idade x doen&#x00E7;a coron&#x00E1;ria</title>
					</caption>
					<graphic xlink:href="0104-4478-rsocp-28-74-e006-gf07.tif"/>
					<attrib>Fonte: elabora&#x00E7;&#x00E3;o pr&#x00F3;pria, com base em Hosmer, Lemeshow e Sturdivant (2013).</attrib>
				</fig>
				<p>A linha pontilhada vertical representa a m&#x00E9;dia da idade: 44,38 anos. Os casos foram codificados como 1 (desenvolveu doen&#x00E7;a coron&#x00E1;ria) e 0 (n&#x00E3;o desenvolveu). A tend&#x00EA;ncia &#x00E9; bastante clara: a medida em que a idade aumenta, cresce a quantidade de pessoas diagnosticadas com doen&#x00E7;as coron&#x00E1;rias. Uma forma intuitiva de observar esse padr&#x00E3;o &#x00E9; examinar o quantitativo de casos tomando a m&#x00E9;dia como par&#x00E2;metro de compara&#x00E7;&#x00E3;o. Por exemplo, para as pessoas acima da m&#x00E9;dia, existem mais casos de doentes, enquanto para as pessoas abaixo da m&#x00E9;dia, a concentra&#x00E7;&#x00E3;o maior &#x00E9; na categoria &#x201C;n&#x00E3;o desenvolveu&#x201D;. Ou seja, o gr&#x00E1;fico est&#x00E1; informando que existe uma associa&#x00E7;&#x00E3;o entre idade e doen&#x00E7;a coron&#x00E1;ria. &#x00C9; nesse sentido que a regress&#x00E3;o log&#x00ED;stica informa a probabilidade da ocorr&#x00EA;ncia do evento que foi codificado como 1, no caso, desenvolveu doen&#x00E7;a coron&#x00E1;ria. A 
					<xref ref-type="table" rid="t10">Tabela 1</xref> apresenta esses dados por grupo de idade.
				</p>
				<table-wrap id="t10">
					<label>Tabela 1</label>
					<caption>
						<title>Grupo de idade x doen&#x00E7;a coron&#x00E1;ria</title>
					</caption>
					<table frame="hsides" rules="groups">
						<colgroup width="20%">
							<col/>
							<col/>
							<col/>
							<col/>
							<col/>
						</colgroup>
						<thead style="border-top: thin solid; border-bottom: 2px solid; border-color: #000000">
							<tr>
								<th align="left" valign="top"></th>
								<th align="center" valign="top"></th>
								<th align="center" valign="top" colspan="2">
									<bold>Doen&#x00E7;a</bold>
								</th>
								<th align="center" valign="top"></th>
							</tr>
							<tr>
								<th align="left" valign="top">
									<bold>Grupo Idade</bold>
								</th>
								<th align="center" valign="top">
									<bold>N</bold>
								</th>
								<th align="center" valign="top">
									<bold>Sim</bold>
								</th>
								<th align="center" valign="top">
									<bold>N&#x00E3;o</bold>
								</th>
								<th align="center" valign="top">
									<bold>Sim (%)</bold>
								</th>
							</tr>
						</thead>
						<tbody style="border-bottom: thin solid; border-color: #000000">
							<tr>
								<td align="left" valign="top">20-29</td>
								<td align="center" valign="top">10</td>
								<td align="center" valign="top">1</td>
								<td align="center" valign="top">9</td>
								<td align="center" valign="top">0,1</td>
							</tr>
							<tr>
								<td align="left" valign="top">30-34</td>
								<td align="center" valign="top">15</td>
								<td align="center" valign="top">2</td>
								<td align="center" valign="top">13</td>
								<td align="center" valign="top">0,13</td>
							</tr>
							<tr>
								<td align="left" valign="top">35-39</td>
								<td align="center" valign="top">12</td>
								<td align="center" valign="top">3</td>
								<td align="center" valign="top">9</td>
								<td align="center" valign="top">0,25</td>
							</tr>
							<tr>
								<td align="left" valign="top">40-44</td>
								<td align="center" valign="top">15</td>
								<td align="center" valign="top">5</td>
								<td align="center" valign="top">10</td>
								<td align="center" valign="top">0,33</td>
							</tr>
							<tr>
								<td align="left" valign="top">45-49</td>
								<td align="center" valign="top">13</td>
								<td align="center" valign="top">6</td>
								<td align="center" valign="top">7</td>
								<td align="center" valign="top">0,46</td>
							</tr>
							<tr>
								<td align="left" valign="top">50-54</td>
								<td align="center" valign="top">8</td>
								<td align="center" valign="top">5</td>
								<td align="center" valign="top">3</td>
								<td align="center" valign="top">0,63</td>
							</tr>
							<tr>
								<td align="left" valign="top">55-59</td>
								<td align="center" valign="top">17</td>
								<td align="center" valign="top">13</td>
								<td align="center" valign="top">4</td>
								<td align="center" valign="top">0,76</td>
							</tr>
							<tr>
								<td align="left" valign="top">60-69</td>
								<td align="center" valign="top">10</td>
								<td align="center" valign="top">8</td>
								<td align="center" valign="top">2</td>
								<td align="center" valign="top">0,8</td>
							</tr>
							<tr>
								<td align="left" valign="top">
									<bold>Total</bold>
								</td>
								<td align="center" valign="top">
									<bold>100</bold>
								</td>
								<td align="center" valign="top">
									<bold>43</bold>
								</td>
								<td align="center" valign="top">
									<bold>57</bold>
								</td>
								<td align="center" valign="top"></td>
							</tr>
						</tbody>
					</table>
					<table-wrap-foot>
						<attrib>Fonte: elabora&#x00E7;&#x00E3;o pr&#x00F3;pria, com base em &nbsp;Hosmer, Lemeshow e Sturdivant (2013).</attrib>
					</table-wrap-foot>
				</table-wrap>
				<p>Basta observar a &#x00FA;ltima coluna para chegar &#x00E0; mesma conclus&#x00E3;o apresentada pelo 
					<xref ref-type="fig" rid="f7">Gr&#x00E1;fico 1</xref>: quanto maior a idade, maior &#x00E9; a chance de desenvolver doen&#x00E7;as coron&#x00E1;rias. Uma op&#x00E7;&#x00E3;o adicional para visualizar a rela&#x00E7;&#x00E3;o entre essas vari&#x00E1;veis &#x00E9; representar graficamente o percentual de doentes para cada grupo de idade (
					<xref ref-type="fig" rid="f8">Gr&#x00E1;fico 2</xref>).
				</p>
				<fig id="f8">
					<label>Gr&#x00E1;fico 2</label>
					<caption>
						<title>Grupo de idade x doen&#x00E7;a coron&#x00E1;ria</title>
					</caption>
					<graphic xlink:href="0104-4478-rsocp-28-74-e006-gf08.tif"/>
					<attrib>Fonte: elabora&#x00E7;&#x00E3;o pr&#x00F3;pria, com base em Hosmer, Lemeshow e Sturdivant (2013).</attrib>
				</fig>
				<p>Observamos uma correla&#x00E7;&#x00E3;o positiva entre idade (eixo X) e a probabilidade de desenvolver doen&#x00E7;as card&#x00ED;acas (eixo Y). A regress&#x00E3;o log&#x00ED;stica vai informar a dire&#x00E7;&#x00E3;o, a magnitude e o n&#x00ED;vel da signific&#x00E2;ncia estat&#x00ED;stica dessa rela&#x00E7;&#x00E3;o. Em s&#x00ED;ntese, o pesquisador deve utilizar a regress&#x00E3;o log&#x00ED;stica quando a vari&#x00E1;vel dependente for categ&#x00F3;rica bin&#x00E1;ria. Dado que muitas vari&#x00E1;veis em Ci&#x00EA;ncias Humanas s&#x00E3;o categ&#x00F3;ricas, os benef&#x00ED;cios anal&#x00ED;ticos associados &#x00E0; correta aplica&#x00E7;&#x00E3;o e interpreta&#x00E7;&#x00E3;o do modelo log&#x00ED;stico s&#x00E3;o evidentes
					<xref ref-type="fn" rid="fn45">
						<sup>12</sup>
					</xref>.
				</p>
			</sec>
			<sec>
				<title>III. Planejando uma regress&#x00E3;o log&#x00ED;stica</title>
				<p>A 
					<xref ref-type="table" rid="t11">Tabela 2</xref> descreve os cinco est&#x00E1;gios que devem ser observados.
				</p>
				<table-wrap id="t11">
					<label>Tabela 2</label>
					<caption>
						<title>Planejamento de uma regress&#x00E3;o log&#x00ED;stica em cinco est&#x00E1;gios</title>
					</caption>
					<table frame="hsides" rules="groups">
						<colgroup width="50%">
							<col/>
							<col/>
						</colgroup>
						<thead style="border-top: thin solid; border-bottom: 2px solid; border-color: #000000">
							<tr>
								<th align="left" valign="top">Est&#x00E1;gio</th>
								<th align="left" valign="top">Descri&#x00E7;&#x00E3;o</th>
							</tr>
						</thead>
						<tbody style="border-bottom: thin solid; border-color: #000000">
							<tr>
								<td align="left" valign="top">1&#x00BA;</td>
								<td align="left" valign="top">Identificar a vari&#x00E1;vel dependente</td>
							</tr>
							<tr>
								<td align="left" valign="top">2&#x00BA;</td>
								<td align="left" valign="top">Observar os requisitos t&#x00E9;cnicos</td>
							</tr>
							<tr>
								<td align="left" valign="top">3&#x00BA;</td>
								<td align="left" valign="top">Estimar e ajustar o modelo</td>
							</tr>
							<tr>
								<td align="left" valign="top">4&#x00BA;</td>
								<td align="left" valign="top">Interpretar os resultados</td>
							</tr>
							<tr>
								<td align="left" valign="top">5&#x00BA;</td>
								<td align="left" valign="top">Validar os resultados</td>
							</tr>
						</tbody>
					</table>
					<table-wrap-foot>
						<attrib>Fonte: elabora&#x00E7;&#x00E3;o pr&#x00F3;pria a partir de 
							<xref ref-type="bibr" rid="B27">Hair 
								<italic>et al.</italic> (2009)
							</xref>.
						</attrib>
					</table-wrap-foot>
				</table-wrap>
				<p>O primeiro passo &#x00E9; identificar uma quest&#x00E3;o de pesquisa em que a vari&#x00E1;vel dependente seja originalmente dicot&#x00F4;mica. Por exemplo, dada a popularidade da regress&#x00E3;o log&#x00ED;stica na &#x00E1;rea de sa&#x00FA;de, uma vari&#x00E1;vel tipicamente utilizada &#x00E9;: viveu/morreu; doente/n&#x00E3;o doente; fumante/n&#x00E3;o fumante. Em geral, o pesquisador deve ser dissuadido de recodificar uma vari&#x00E1;vel cont&#x00ED;nua ou discreta em uma vari&#x00E1;vel categ&#x00F3;rica dicot&#x00F4;mica. Para explicar, suponha que a vari&#x00E1;vel de interesse &#x00E9; renda per capita. &#x00C9; errado recodificar a renda com o objetivo de produzir duas categorias: rico versus pobre. Tecnicamente, a recodifica&#x00E7;&#x00E3;o de uma vari&#x00E1;vel quantitativa em categ&#x00F3;rica implica em perda de informa&#x00E7;&#x00E3;o e isso reduz a consist&#x00EA;ncia das estimativas (
					<xref ref-type="bibr" rid="B12">Fernandes 
						<italic>et al.</italic>, 2019
					</xref>)
					<xref ref-type="fn" rid="fn46">
						<sup>13</sup>
					</xref>.
				</p>
				<p>No segundo est&#x00E1;gio, deve-se observar os requisitos t&#x00E9;cnicos. Apesar de ser mais flex&#x00ED;vel do que outras t&#x00E9;cnicas estat&#x00ED;sticas, a regress&#x00E3;o log&#x00ED;stica &#x00E9; sens&#x00ED;vel, por exemplo, a problemas de multicolinearidade (altos n&#x00ED;veis de correla&#x00E7;&#x00E3;o entre as vari&#x00E1;veis independentes)
					<xref ref-type="fn" rid="fn47">
						<sup>14</sup>
					</xref>. Existem diferentes procedimentos para minimizar esse problema. O mais simples &#x00E9; aumentar o n&#x00FA;mero de observa&#x00E7;&#x00F5;es (
					<xref ref-type="bibr" rid="B35">Kennedy, 2005</xref>). Uma sa&#x00ED;da adicional &#x00E9; utilizar alguma t&#x00E9;cnica de redu&#x00E7;&#x00E3;o de dados para criar uma medida s&#x00ED;ntese a partir da vari&#x00E2;ncia das vari&#x00E1;veis originais. N&#x00E3;o devemos simplesmente excluir uma das vari&#x00E1;veis independentes, sob pena de produzir erros de especifica&#x00E7;&#x00E3;o no modelo. Na regress&#x00E3;o log&#x00ED;stica o tamanho da amostra &#x00E9; fundamental (Hair 
					<italic>et al.</italic>, 2009). Amostras pequenas tendem a produzir estimativas inconsistentes. Por outro lado, amostras excessivamente grandes aumentam o poder dos testes estat&#x00ED;sticos de tal sorte que qualquer efeito tende a ser estatisticamente significativo, independentemente da magnitude. Hosmer e Lemeshow (2000) sugerem um 
					<italic>n</italic> m&#x00ED;nimo de 400 casos. 
					<xref ref-type="bibr" rid="B27">Hair 
						<italic>et al.</italic> (2009)
					</xref> sugerem uma raz&#x00E3;o de 10 casos para cada vari&#x00E1;vel independente inclu&#x00ED;da no modelo. 
					<xref ref-type="bibr" rid="B56">Pedhazur (1982)</xref> recomenda uma raz&#x00E3;o de 30 casos para cada par&#x00E2;metro estimado.
				</p>
				<p>Outra eventual fonte de problema s&#x00E3;o os 
					<italic>outliers</italic>. Os casos extremos produzem resultados desastrosos em an&#x00E1;lise de dados e no caso da regress&#x00E3;o log&#x00ED;stica a presen&#x00E7;a de observa&#x00E7;&#x00F5;es at&#x00ED;picas pode prejudicar o ajuste do modelo. Uma vez detectados os casos aberrantes o pesquisador deve decidir o que fazer com eles. Algumas vezes um caso extremo n&#x00E3;o passa de um erro de digita&#x00E7;&#x00E3;o, o que pode ser facilmente resolvido. Uma op&#x00E7;&#x00E3;o &#x00E9; excluir os 
					<italic>outliers</italic> da estima&#x00E7;&#x00E3;o do modelo e mensurar o impacto de sua inclus&#x00E3;o sobre os coeficientes. Outro procedimento comumente adotado &#x00E9; recodificar o caso, imputando-lhe um valor menos extremo, a m&#x00E9;dia por exemplo. De toda forma, &#x00E9; importante descrever detalhadamente o que foi feito para lidar com eventuais observa&#x00E7;&#x00F5;es extremas
					<xref ref-type="fn" rid="fn48">
						<sup>15</sup>
					</xref>.
				</p>
				<p>No terceiro est&#x00E1;gio, o pesquisador deve estimar o modelo. Aqui dois procedimentos s&#x00E3;o essenciais: a) reportar o 
					<italic>software</italic> e b) compartilhar os materiais de replica&#x00E7;&#x00E3;o, que incluem os dados originais, os dados tratados e os 
					<italic>scripts</italic> computacionais
					<xref ref-type="fn" rid="fn49">
						<sup>16</sup>
					</xref>. Esses procedimentos aumentam a transpar&#x00EA;ncia e facilitam replicablidade dos resultados (
					<xref ref-type="bibr" rid="B38">King, 1995</xref>; 
					<xref ref-type="bibr" rid="B54">Paranhos 
						<italic>et al.</italic>, 2013
					</xref>; 
					<xref ref-type="bibr" rid="B33">Janz, 2016</xref>; 
					<xref ref-type="bibr" rid="B12">Figueiredo Filho 
						<italic>et al.</italic>, 2019
					</xref>). Depois de estimar o modelo, o pr&#x00F3;ximo passo &#x00E9; avaliar a qualidade do ajuste. Isso pode ser feito a partir da compara&#x00E7;&#x00E3;o do modelo nulo (apenas intercepto) com o modelo que incorpora as vari&#x00E1;veis independentes. Uma diferen&#x00E7;a estatisticamente significativa entre os modelos indica que as vari&#x00E1;veis explicativas ajudam a prever a ocorr&#x00EA;ncia da vari&#x00E1;vel dependente. A 
					<xref ref-type="fig" rid="f9">Figura 2</xref> mostra a l&#x00F3;gica subjacente &#x00E0; compara&#x00E7;&#x00E3;o de modelos quando lidamos com a regress&#x00E3;o log&#x00ED;stica.
				</p>
				<fig id="f9">
					<label>Figura 2</label>
					<caption>
						<title>Comparando o ajuste dos modelos log&#x00ED;sticos</title>
					</caption>
					<graphic xlink:href="0104-4478-rsocp-28-74-e006-gf09.tif"/>
					<attrib>Fonte: 
						<xref ref-type="bibr" rid="B27">Hair 
							<italic>et al.</italic> (2009)
						</xref>.
					</attrib>
				</fig>
				<p>Comparativamente, o modelo B &#x00E9; mais bem ajustado do que o modelo A. Isso pode ser observado pela diferen&#x00E7;a na capacidade discriminat&#x00F3;ria. Enquanto o modelo A apresenta alta variabilidade, o modelo B &#x00E9; mais preciso. Para Tabachnick, Fidell e Ullman,</p>
				<disp-quote>
					<p>[...] &#x201C;a regress&#x00E3;o log&#x00ED;stica, assim como an&#x00E1;lise de frequ&#x00EA;ncia m&#x00FA;ltipla, pode ser utilizada para ajustar e comparar modelos. O modelo mais simples (e pior ajustado) inclui apenas a constante e nenhum dos preditores. O modelo mais complexo (e melhor ajustado) inclui a constante, todos os preditores e, talvez, intera&#x00E7;&#x00F5;es entre os preditores. Muitas vezes, no entanto, nem todos os preditores (e intera&#x00E7;&#x00F5;es) est&#x00E3;o relacionados com o resultado. O investigador usa testes de qualidade do ajuste para escolher o modelo que faz o melhor trabalho de previs&#x00E3;o com o menor n&#x00FA;mero de preditores&#x201D; (
						<xref ref-type="bibr" rid="B66">Tabachnick, Fidell &amp; Ullman, 2007</xref>, p.439).
					</p>
				</disp-quote>
				<p>O quarto est&#x00E1;gio consiste na interpreta&#x00E7;&#x00E3;o dos resultados. Infelizmente, muitos trabalhos se limitam a analisar a signific&#x00E2;ncia estat&#x00ED;stica das estimativas e n&#x00E3;o conferem aten&#x00E7;&#x00E3;o &#x00E0; magnitude dos coeficientes. Sugerimos que os pesquisadores interpretem os coeficientes e discutam substantivamente como os resultados se relacionam com a hip&#x00F3;tese de pesquisa. Diferente da regress&#x00E3;o linear, em que os coeficientes s&#x00E3;o f&#x00E1;ceis de interpretar, as estimativas produzidas no modelo log&#x00ED;stico s&#x00E3;o menos intuitivas
					<xref ref-type="fn" rid="fn50">
						<sup>17</sup>
					</xref>. Isso porque a transforma&#x00E7;&#x00E3;o 
					<italic>logit</italic> informa o efeito da vari&#x00E1;vel independente sobre a varia&#x00E7;&#x00E3;o do logaritmo natural da chance da vari&#x00E1;vel dependente. Por exemplo, ao se considerar um coeficiente de 0,6, espera-se um acr&#x00E9;scimo de 0,6 unidades no 
					<italic>logit</italic> de Y sempre que X aumenta uma unidade. A principal desvantagem dessa abordagem &#x00E9; a falta de inteligibilidade. Afirmar que a quantidade de 
					<italic>logit</italic> aumentou em 0,6 unidades &#x00E9; pouco intuitivo e n&#x00E3;o ajuda a entender a rela&#x00E7;&#x00E3;o entre as vari&#x00E1;veis.
				</p>
				<p>Uma segunda possibilidade &#x00E9; analisar o impacto das vari&#x00E1;veis independentes sobre a chance (
					<italic>odds)</italic> de Y. Para isso o pesquisador deve obter o exponencial do pr&#x00F3;prio coeficiente. Em nosso exemplo, o exponencial de 0,6&nbsp;=&nbsp;1,82. Isso significa que a cada unidade adicional em X, espera-se um aumento de 1,82 na chance de ocorr&#x00EA;ncia de Y, mantendo as demais vari&#x00E1;veis constantes
					<xref ref-type="fn" rid="fn51">
						<sup>18</sup>
					</xref>. O 
					<xref ref-type="fig" rid="f10">Gr&#x00E1;fico 3</xref> ilustra a distribui&#x00E7;&#x00E3;o de uma fun&#x00E7;&#x00E3;o exponencial de uma simula&#x00E7;&#x00E3;o em que x varia entre -5 e 5.
				</p>
				<fig id="f10">
					<label>Gr&#x00E1;fico 3</label>
					<caption>
						<title>Fun&#x00E7;&#x00E3;o exponencial</title>
					</caption>
					<graphic xlink:href="0104-4478-rsocp-28-74-e006-gf10.tif"/>
					<attrib>Fonte: elabora&#x00E7;&#x00E3;o pr&#x00F3;pria, com base em Hosmer, Lemeshow e Sturdivant (2013).</attrib>
				</fig>
				<p>Na regress&#x00E3;o log&#x00ED;stica, o exponencial de um valor positivo (+) produz um coeficiente maior do que 1. Contrariamente, um coeficiente negativo (-) retorna um Exp (&#x03B2;) menor do que 1. Um coeficiente de valor zero produz um Exp (&#x03B2;) igual a 1, indicando que a vari&#x00E1;vel independente n&#x00E3;o afeta a chance de ocorr&#x00EA;ncia da vari&#x00E1;vel dependente. Ent&#x00E3;o, anote a&#x00ED; no seu caderno: quanto mais distante o coeficiente estiver de um, independente da dire&#x00E7;&#x00E3;o, maior &#x00E9; o impacto de uma determinada vari&#x00E1;vel independente sobre a chance da ocorr&#x00EA;ncia do evento de interesse
					<xref ref-type="fn" rid="fn52">
						<sup>19</sup>
					</xref>.
				</p>
				<p>A terceira possibilidade &#x00E9; estimar o aumento percentual na chance de ocorr&#x00EA;ncia de Y. Para tanto, deve-se subtrair uma unidade do coeficiente de regress&#x00E3;o exponencializado e multiplicar o resultado por 100, no caso, (1,82-1 * 100). Temos ent&#x00E3;o que o aumento de uma unidade em X est&#x00E1; associado a um incremento de 82% na chance de ocorr&#x00EA;ncia de Y (
					<italic>ceteris paribus)</italic>. A interpreta&#x00E7;&#x00E3;o dos coeficientes da regress&#x00E3;o log&#x00ED;stica pode ficar um pouco mais complicada quando a chance &#x00E9; menor do que 1, ou seja, quando o coeficiente (&#x03B2;) &#x00E9; negativo. Uma solu&#x00E7;&#x00E3;o &#x00E9; inverter o coeficiente (1/valor do coeficiente) o que facilita a interpreta&#x00E7;&#x00E3;o. Por exemplo, um coeficiente de 0,639 quando invertido indica que quando a vari&#x00E1;vel independente diminui em uma unidade, espera-se um aumento m&#x00E9;dio de 1,56 na chance de ocorr&#x00EA;ncia da vari&#x00E1;vel dependente.
				</p>
				<p>Por fim, o pesquisador deve validar os resultados observados com uma sub-amostra de sua base de dados original. Esse procedimento confere maior confiabilidade aos resultados de pesquisa, principalmente quando se trabalha com amostras pequenas. De acordo com 
					<xref ref-type="bibr" rid="B27">Hair 
						<italic>et al.</italic> (2009)
					</xref>,
				</p>
				<disp-quote>
					<p>&#x201C;a abordagem mais comum para estabelecer a validade externa &#x00E9; que a avalia&#x00E7;&#x00E3;o da taxa geral de acerto seja utilizando uma amostra separada (amostra de 
						<italic>holdout</italic>) seja atrav&#x00E9;s de simula&#x00E7;&#x00F5;es (
						<italic>bootstrapping</italic>). Validade externa &#x00E9; ratificada quando a taxa de acertos da abordagem selecionada excede os padr&#x00F5;es de compara&#x00E7;&#x00E3;o que representam a precis&#x00E3;o preditiva esperada ao acaso&#x201D; (Hair 
						<italic>et al.</italic>, 2009, p. 330).
					</p>
				</disp-quote>
				<p>Infelizmente, esse procedimento raramente &#x00E9; utilizado pelos cientistas pol&#x00ED;ticos. Desconfiamos que a reduzida utiliza&#x00E7;&#x00E3;o da valida&#x00E7;&#x00E3;o se explica, parcialmente, pela falta de treinamento sobre as especificidades da regress&#x00E3;o log&#x00ED;stica. A pr&#x00F3;xima se&#x00E7;&#x00E3;o apresenta um exemplo aplicado da regress&#x00E3;o log&#x00ED;stica e explica como os resultados devem ser interpretados.</p>
			</sec>
			<sec>
				<title>IV. Um exemplo aplicado</title>
				<p>Para ilustrar a aplica&#x00E7;&#x00E3;o do modelo de regress&#x00E3;o log&#x00ED;stica replicamos os dados de Castro e Nunes (2014) sobre corrup&#x00E7;&#x00E3;o e reelei&#x00E7;&#x00E3;o
					<xref ref-type="fn" rid="fn53">
						<sup>20</sup>
					</xref>. Todavia, como o nosso foco &#x00E9; puramente metodol&#x00F3;gico n&#x00E3;o iremos explorar o significado substantivo das conclus&#x00F5;es reportadas pelos autores. Seguindo o planejamento da se&#x00E7;&#x00E3;o anterior, o primeiro passo &#x00E9; identificar a vari&#x00E1;vel dependente, qual seja: assume valor &#x201C;1&#x201D; para os candidatos que foram reeleitos em 2006 e valor &#x201C;0&#x201D; caso contr&#x00E1;rio
					<xref ref-type="fn" rid="fn54">
						<sup>21</sup>
					</xref>.
				</p>
				<p>O segundo passo &#x00E9; verificar os requisitos t&#x00E9;cnicos para a estima&#x00E7;&#x00E3;o da regress&#x00E3;o log&#x00ED;stica. Nessa etapa &#x00E9; importante observar a presen&#x00E7;a de eventuais 
					<italic>outliers</italic>, exist&#x00EA;ncia de alta correla&#x00E7;&#x00E3;o entre as vari&#x00E1;veis independentes e adequada quantidade de observa&#x00E7;&#x00F5;es. Por limita&#x00E7;&#x00E3;o de espa&#x00E7;o, reproduziremos apenas um dos modelos apresentados por Castro e Nunes (2014). Em particular, a amostra utilizada para estimar o modelo 5 da 
					<xref ref-type="table" rid="t6">Tabela 6</xref> possui um total de 217 observa&#x00E7;&#x00F5;es e uma propor&#x00E7;&#x00E3;o de 19 casos para cada vari&#x00E1;vel independente. N&#x00E3;o encontramos casos destoantes e o n&#x00ED;vel de correla&#x00E7;&#x00E3;o entre as vari&#x00E1;veis inclu&#x00ED;das no modelo &#x00E9; aceit&#x00E1;vel. Dessa forma, podemos seguir para a pr&#x00F3;xima fase.
				</p>
				<p>O terceiro est&#x00E1;gio consiste na estima&#x00E7;&#x00E3;o do modelo
					<xref ref-type="fn" rid="fn55">
						<sup>22</sup>
					</xref>:
				</p>
				<disp-formula id="eq4">
					<label>(2)</label>
					<mml:math id="m4" display="block">
						<mml:mrow>
							<mml:mtext>logit</mml:mtext>
							<mml:mo stretchy="false">(</mml:mo>
							<mml:mtext>Y</mml:mtext>
							<mml:mo stretchy="false">)</mml:mo>
							<mml:mo>=</mml:mo>
							<mml:mtext>&#x03B1;</mml:mtext>
							<mml:mo>+</mml:mo>
							<mml:msub>
								<mml:mtext>X</mml:mtext>
								<mml:mtext>1</mml:mtext>
							</mml:msub>
							<mml:msub>
								<mml:mtext>&#x03B2;</mml:mtext>
								<mml:mtext>1</mml:mtext>
							</mml:msub>
							<mml:mo>+</mml:mo>
							<mml:msub>
								<mml:mtext>X</mml:mtext>
								<mml:mtext>2</mml:mtext>
							</mml:msub>
							<mml:msub>
								<mml:mtext>&#x03B2;</mml:mtext>
								<mml:mtext>2</mml:mtext>
							</mml:msub>
							<mml:mo>+</mml:mo>
							<mml:msub>
								<mml:mtext>X</mml:mtext>
								<mml:mtext>3</mml:mtext>
							</mml:msub>
							<mml:msub>
								<mml:mtext>&#x03B2;</mml:mtext>
								<mml:mtext>3</mml:mtext>
							</mml:msub>
							<mml:mo>+</mml:mo>
							<mml:msub>
								<mml:mtext>X</mml:mtext>
								<mml:mtext>4</mml:mtext>
							</mml:msub>
							<mml:msub>
								<mml:mtext>&#x03B2;</mml:mtext>
								<mml:mtext>4</mml:mtext>
							</mml:msub>
							<mml:mo>+</mml:mo>
							<mml:msub>
								<mml:mtext>X</mml:mtext>
								<mml:mtext>5</mml:mtext>
							</mml:msub>
							<mml:msub>
								<mml:mtext>&#x03B2;</mml:mtext>
								<mml:mtext>5</mml:mtext>
							</mml:msub>
							<mml:mo>+</mml:mo>
							<mml:msub>
								<mml:mtext>X</mml:mtext>
								<mml:mtext>6</mml:mtext>
							</mml:msub>
							<mml:msub>
								<mml:mtext>&#x03B2;</mml:mtext>
								<mml:mtext>6</mml:mtext>
							</mml:msub>
							<mml:mo>+</mml:mo>
							<mml:msub>
								<mml:mtext>X</mml:mtext>
								<mml:mtext>7</mml:mtext>
							</mml:msub>
							<mml:msub>
								<mml:mtext>&#x03B2;</mml:mtext>
								<mml:mtext>7</mml:mtext>
							</mml:msub>
							<mml:mo>+</mml:mo>
							<mml:msub>
								<mml:mtext>X</mml:mtext>
								<mml:mtext>8</mml:mtext>
							</mml:msub>
							<mml:msub>
								<mml:mtext>&#x03B2;</mml:mtext>
								<mml:mtext>8</mml:mtext>
							</mml:msub>
							<mml:mo>+</mml:mo>
							<mml:msub>
								<mml:mtext>X</mml:mtext>
								<mml:mtext>9</mml:mtext>
							</mml:msub>
							<mml:msub>
								<mml:mtext>&#x03B2;</mml:mtext>
								<mml:mtext>9</mml:mtext>
							</mml:msub>
							<mml:mo>+</mml:mo>
							<mml:msub>
								<mml:mtext>X</mml:mtext>
								<mml:mrow>
									<mml:mtext>10</mml:mtext>
								</mml:mrow>
							</mml:msub>
							<mml:msub>
								<mml:mtext>&#x03B2;</mml:mtext>
								<mml:mrow>
									<mml:mtext>10</mml:mtext>
								</mml:mrow>
							</mml:msub>
							<mml:mo>+</mml:mo>
							<mml:msub>
								<mml:mtext>X</mml:mtext>
								<mml:mrow>
									<mml:mtext>11</mml:mtext>
								</mml:mrow>
							</mml:msub>
							<mml:msub>
								<mml:mtext>&#x03B2;</mml:mtext>
								<mml:mrow>
									<mml:mtext>11</mml:mtext>
								</mml:mrow>
							</mml:msub>
							<mml:mo>+</mml:mo>
							<mml:mtext>&#x03B5;</mml:mtext>
						</mml:mrow>
					</mml:math>
				</disp-formula>
				<p>O 
					<xref ref-type="table" rid="ch2">Quadro 1</xref> sumariza como as vari&#x00E1;veis foram mensuradas.
				</p>
				<table-wrap id="ch2">
					<label>Quadro 1</label>
					<caption>
						<title>N&#x00ED;vel de mensura&#x00E7;&#x00E3;o das vari&#x00E1;veis</title>
					</caption>
					<table frame="hsides" rules="groups">
						<colgroup width="50%">
							<col/>
							<col/>
						</colgroup>
						<thead style="border-top: thin solid; border-bottom: 2px solid; border-color: #000000">
							<tr>
								<th align="left" valign="top">Vari&#x00E1;veis</th>
								<th align="left" valign="top">Descri&#x00E7;&#x00E3;o</th>
							</tr>
						</thead>
						<tbody style="border-bottom: thin solid; border-color: #000000">
							<tr>
								<td align="left" valign="top">Sexo (Controle)</td>
								<td align="left" valign="top">
									<italic>Dummy:</italic> Feminino (0); Masculino (1)
								</td>
							</tr>
							<tr>
								<td align="left" valign="top">Idade (Controle)</td>
								<td align="left" valign="top">Cont&#x00ED;nua: idade na elei&#x00E7;&#x00E3;o.</td>
							</tr>
							<tr>
								<td align="left" valign="top">Escolaridade (Controle)</td>
								<td align="left" valign="top">Categ&#x00F3;rica ordinal: L&#x00EA; e escreve (0); Ensino Fundamental incompleto (1); Ensino Fundamental completo (2); Ensino M&#x00E9;dio incompleto (3); Ensino M&#x00E9;dio completo (4); Superior incompleto (5); Ensino Superior (6).</td>
							</tr>
							<tr>
								<td align="left" valign="top">Pobreza (Controle)</td>
								<td align="left" valign="top">Cont&#x00ED;nua: percentual de pessoas pobres no estado.</td>
							</tr>
							<tr>
								<td align="left" valign="top">Ideologia (Controle)</td>
								<td align="left" valign="top">Categ&#x00F3;rica: Esquerda (0); Centro (1); Direita (2).</td>
							</tr>
							<tr>
								<td align="left" valign="top">Aumento Voto 2006 (Controle)</td>
								<td align="left" valign="top">
									<italic>Dummy</italic>: Aumentou (1); Diminuiu (0).
								</td>
							</tr>
							<tr>
								<td align="left" valign="top">Migrante (Controle)</td>
								<td align="left" valign="top">
									<italic>Dummy</italic>: Migrou de partido (1); N&#x00E3;o migrou (0).
								</td>
							</tr>
							<tr>
								<td align="left" valign="top">
									<italic>Pork</italic> (Controle)
								</td>
								<td align="left" valign="top">Cont&#x00ED;nua: taxa de sucesso de execu&#x00E7;&#x00E3;o das emendas parlamentares.</td>
							</tr>
							<tr>
								<td align="left" valign="top">Cadeiras por Estado (Controle)</td>
								<td align="left" valign="top">Cont&#x00ED;nua: n&#x00FA;mero de cadeiras de cada estado na C&#x00E2;mara dos Deputados.</td>
							</tr>
							<tr>
								<td align="left" valign="top">Despesa (Controle)</td>
								<td align="left" valign="top">Cont&#x00ED;nua: despesa de campanha</td>
							</tr>
							<tr>
								<td align="left" valign="top">Esc&#x00E2;ndalo (VI)</td>
								<td align="left" valign="top">
									<italic>Dummy:</italic> Envolvido em esc&#x00E2;ndalo (1); N&#x00E3;o envolvido (0).
								</td>
							</tr>
							<tr>
								<td align="left" valign="top">Reelei&#x00E7;&#x00E3;o (VD)</td>
								<td align="left" valign="top">
									<italic>Dummy:</italic> Reeleito (1); N&#x00E3;o Reeleito (0).
								</td>
							</tr>
						</tbody>
					</table>
					<table-wrap-foot>
						<attrib>Fonte: elabora&#x00E7;&#x00E3;o pr&#x00F3;pria a partir de Castro e Nunes (2014, p. 38-40).</attrib>
					</table-wrap-foot>
				</table-wrap>
				<p>Testaremos tr&#x00EA;s hip&#x00F3;teses:</p>
				<p>
					<italic>H</italic>
					<sub>
						<italic>1</italic>
					</sub>: estar envolvido em esc&#x00E2;ndalo de corrup&#x00E7;&#x00E3;o reduz a probabilidade de reelei&#x00E7;&#x00E3;o;
				</p>
				<p>
					<italic>H</italic>
					<sub>
						<italic>2</italic>
					</sub>: quanto maior a despesa de campanha, maior a probabilidade de reelei&#x00E7;&#x00E3;o;
				</p>
				<p>
					<italic>H</italic>
					<sub>
						<italic>3</italic>
					</sub>: quanto maior a execu&#x00E7;&#x00E3;o de emendas, maior a probabilidade de reelei&#x00E7;&#x00E3;o.
				</p>
			</sec>
			<sec>
				<title>V. Resultados</title>
				<p>O primeiro passo &#x00E9; analisar a distribui&#x00E7;&#x00E3;o da vari&#x00E1;vel dependente. A 
					<xref ref-type="table" rid="t12">Tabela 3</xref> sumariza essas informa&#x00E7;&#x00F5;es.
				</p>
				<table-wrap id="t12">
					<label>Tabela 3</label>
					<caption>
						<title>Distribui&#x00E7;&#x00E3;o de frequ&#x00EA;ncia da vari&#x00E1;vel dependente (reeleito)</title>
					</caption>
					<table frame="hsides" rules="groups">
						<colgroup width="33%">
							<col/>
							<col/>
							<col/>
						</colgroup>
						<thead style="border-top: thin solid; border-bottom: 2px solid; border-color: #000000">
							<tr>
								<th align="left" valign="top">Reeleito</th>
								<th align="center" valign="top">N</th>
								<th align="center" valign="top">%</th>
							</tr>
						</thead>
						<tbody style="border-bottom: thin solid; border-color: #000000">
							<tr>
								<td align="left" valign="top">Sim</td>
								<td align="center" valign="top">273</td>
								<td align="center" valign="top">60,53</td>
							</tr>
							<tr>
								<td align="left" valign="top">N&#x00E3;o</td>
								<td align="center" valign="top">178</td>
								<td align="center" valign="top">39,47</td>
							</tr>
							<tr>
								<td align="left" valign="top">
									<bold>Total</bold>
								</td>
								<td align="center" valign="top">
									<bold>451</bold>
								</td>
								<td align="center" valign="top">
									<bold>100,0</bold>
								</td>
							</tr>
						</tbody>
					</table>
					<table-wrap-foot>
						<attrib>Fonte: elabora&#x00E7;&#x00E3;o pr&#x00F3;pria.</attrib>
					</table-wrap-foot>
				</table-wrap>
				<p>Existem informa&#x00E7;&#x00F5;es para 451 casos. Desse total, 60,53% dos deputados federais foram reconduzidos em 2006, o que significa 273 ocorr&#x00EA;ncias
					<xref ref-type="fn" rid="fn56">
						<sup>23</sup>
					</xref>. Dizemos ent&#x00E3;o que a probabilidade de reelei&#x00E7;&#x00E3;o &#x00E9; de 0,605. Por sua vez, a chance de ser reeleito pode ser calculada pela divis&#x00E3;o entre as probabilidades (sim/n&#x00E3;o), no caso, 0,605/0,395 = 1,53. A 
					<xref ref-type="table" rid="t13">Tabela 4</xref> ilustra essas informa&#x00E7;&#x00F5;es.
				</p>
				<table-wrap id="t13">
					<label>Tabela 4</label>
					<caption>
						<title>Taxa comparativa de reelei&#x00E7;&#x00E3;o (envolvidos x n&#x00E3;o envolvidos) (%)</title>
					</caption>
					<table frame="hsides" rules="groups">
						<colgroup width="25%">
							<col/>
							<col/>
							<col/>
							<col/>
						</colgroup>
						<thead style="border-top: thin solid; border-bottom: 2px solid; border-color: #000000">
							<tr>
								<th align="left" valign="top">Envolvido em esc&#x00E2;ndalo</th>
								<th align="center" valign="top" colspan="2" style="border-bottom: thin solid;">Reeleito</th>
								<th align="center" valign="top">Total</th>
							</tr>
							<tr>
								<th align="left" valign="top"></th>
								<th align="center" valign="top">Sim</th>
								<th align="center" valign="top">N&#x00E3;o</th>
								<th align="center" valign="top"></th>
							</tr>
						</thead>
						<tbody style="border-bottom: thin solid; border-color: #000000">
							<tr>
								<td align="left" valign="top">
									<bold>Sim 10</bold>
								</td>
								<td align="center" valign="top">(17,86)</td>
								<td align="center" valign="top">46 (82,14)</td>
								<td align="center" valign="top">
									<bold>56 (100,0)</bold>
								</td>
							</tr>
							<tr>
								<td align="left" valign="top">
									<bold>N&#x00E3;o</bold>
								</td>
								<td align="center" valign="top">263 (66,58)</td>
								<td align="center" valign="top">132 (33,42)</td>
								<td align="center" valign="top">
									<bold>395 (100,0)</bold>
								</td>
							</tr>
							<tr>
								<td align="left" valign="top">
									<bold>Total</bold>
								</td>
								<td align="center" valign="top">
									<bold>273 (60,53)</bold>
								</td>
								<td align="center" valign="top">
									<bold>178 (39,47)</bold>
								</td>
								<td align="center" valign="top">
									<bold>451 (100,0)</bold>
								</td>
							</tr>
						</tbody>
					</table>
					<table-wrap-foot>
						<attrib>Fonte: elabora&#x00E7;&#x00E3;o pr&#x00F3;pria.</attrib>
					</table-wrap-foot>
				</table-wrap>
				<p>Ao considerar apenas os candidatos envolvidos em esc&#x00E2;ndalos de corrup&#x00E7;&#x00E3;o, o percentual de reelei&#x00E7;&#x00E3;o foi de 17,86%, j&#x00E1; que 10 dos 56 parlamentares conseguiram um novo mandato
					<xref ref-type="fn" rid="fn57">
						<sup>24</sup>
					</xref>. Isso quer dizer que, para esse grupo, a probabilidade de reelei&#x00E7;&#x00E3;o &#x00E9; 0,179 e a chance de reelei&#x00E7;&#x00E3;o &#x00E9; de 0,22. Para os candidatos n&#x00E3;o envolvidos em esc&#x00E2;ndalos de corrup&#x00E7;&#x00E3;o, a chance de ser reeleito &#x00E9; de 1,9. Fundamentalmente, em nosso exemplo de replica&#x00E7;&#x00E3;o, a regress&#x00E3;o log&#x00ED;stica consiste na an&#x00E1;lise comparativa do percentual de reelei&#x00E7;&#x00E3;o entre candidatos envolvidos em esc&#x00E2;ndalos de corrup&#x00E7;&#x00E3;o e os n&#x00E3;o envolvidos
					<xref ref-type="fn" rid="fn58">
						<sup>25</sup>
					</xref>.
				</p>
				<p>Em termos de ajuste geral do modelo, um dos principais testes utilizados &#x00E9; o de Hosmer e Lemeshow (2000). Esse teste &#x00E9; considerado mais robusto do que o teste de chi-quadrado comum, principalmente quando existem vari&#x00E1;veis independentes cont&#x00ED;nuas ou quando o tamanho da amostra &#x00E9; pequeno (
					<xref ref-type="bibr" rid="B22">Garson, 2011</xref>). A 
					<xref ref-type="table" rid="t14">Tabela 5</xref> sumariza as informa&#x00E7;&#x00F5;es de interesse (valor do teste, os graus de liberdade e a signific&#x00E2;ncia estat&#x00ED;stica) para o Teste de Hosmer e Lemeshow e a 
					<xref ref-type="table" rid="t15">Tabela 6</xref> apresenta as mesmas informa&#x00E7;&#x00F5;es para o teste Omnibus dos coeficientes do modelo.
				</p>
				<table-wrap id="t14">
					<label>Tabela 5</label>
					<caption>
						<title>Teste de Hosmer e Lemeshow</title>
					</caption>
					<table frame="hsides" rules="groups">
						<colgroup width="33%">
							<col/>
							<col/>
							<col/>
						</colgroup>
						<thead style="border-top: thin solid; border-bottom: 2px solid; border-color: #000000">
							<tr>
								<th align="left" valign="top">&#x03C7;
									<sup>2</sup>
								</th>
								<th align="center" valign="top">gl</th>
								<th align="center" valign="top">Sig</th>
							</tr>
						</thead>
						<tbody style="border-bottom: thin solid; border-color: #000000">
							<tr>
								<td align="left" valign="top">6,832</td>
								<td align="center" valign="top">8</td>
								<td align="center" valign="top">0,555</td>
							</tr>
						</tbody>
					</table>
					<table-wrap-foot>
						<attrib>Fonte: elabora&#x00E7;&#x00E3;o pr&#x00F3;pria.</attrib>
					</table-wrap-foot>
				</table-wrap>
				<table-wrap id="t15">
					<label>Tabela 6</label>
					<caption>
						<title>Teste Omnibus dos coeficientes do modelo</title>
					</caption>
					<table frame="hsides" rules="groups">
						<colgroup width="33%">
							<col/>
							<col/>
							<col/>
						</colgroup>
						<thead style="border-top: thin solid; border-bottom: 2px solid; border-color: #000000">
							<tr>
								<th align="left" valign="top">&#x03C7;
									<sup>2</sup>
								</th>
								<th align="center" valign="top">gl</th>
								<th align="center" valign="top">Sig</th>
							</tr>
						</thead>
						<tbody style="border-bottom: thin solid; border-color: #000000">
							<tr>
								<td align="left" valign="top">56.356</td>
								<td align="center" valign="top">11</td>
								<td align="center" valign="top">0.000</td>
							</tr>
						</tbody>
					</table>
					<table-wrap-foot>
						<attrib>Fonte: elabora&#x00E7;&#x00E3;o pr&#x00F3;pria.</attrib>
					</table-wrap-foot>
				</table-wrap>
				<p>Um resultado n&#x00E3;o significativo (p &gt; 0,05) sugere que o modelo estimado com as vari&#x00E1;veis independentes &#x00E9; melhor do que o modelo nulo. O modelo estimado apresentou um chi-quadrado (&#x03C7;
					<sup>2</sup>) de 6,832 e p-valor de 0,555, sugerindo um ajuste adequado. Outra medida de ajuste comumente utilizada &#x00E9; o omnibus teste dos coeficientes (
					<italic>Omnibus test of model coefficients</italic>). &#x00C9; um teste de chi-quadrado comparando a vari&#x00E2;ncia do seu modelo com vari&#x00E1;veis independentes e o modelo nulo (apenas o intercepto).
				</p>
				<p>Diferente do teste de Hosmer e Lemeshow, um resultado significativo (p&nbsp;&lt;&nbsp;0,05) sugere um ajuste adequado. De acordo com os dados, o modelo apresentou um chi-quadrado de 56,356 (p-valor &lt; 0,001), ou seja, o modelo ajustado &#x00E9; melhor do que o modelo nulo. Assim, devemos inferir que as vari&#x00E1;veis independentes influenciam a varia&#x00E7;&#x00E3;o da vari&#x00E1;vel dependente
					<xref ref-type="fn" rid="fn59">
						<sup>26</sup>
					</xref>. N&#x00E3;o encontramos esses testes nem no artigo de Castro e Nunes (2014), nem nos scripts computacionais. A 
					<xref ref-type="table" rid="t16">Tabela 7</xref> sumariza os coeficientes estimados do modelo de regress&#x00E3;o log&#x00ED;stica na tentativa de reproduzir os resultados reportados na 
					<xref ref-type="table" rid="t15">Tabela 6</xref> de Castro e Nunes (2014).
				</p>
				<table-wrap id="t16">
					<label>Tabela 7</label>
					<caption>
						<title>Coeficientes do modelo de regress&#x00E3;o log&#x00ED;stica
							<xref ref-type="table-fn" rid="TFN5">*</xref>
						</title>
					</caption>
					<table frame="hsides" rules="groups">
						<colgroup width="14%">
							<col/>
							<col/>
							<col/>
							<col/>
							<col/>
							<col/>
							<col/>
						</colgroup>
						<thead style="border-top: thin solid; border-bottom: 2px solid; border-color: #000000">
							<tr>
								<th align="left" valign="top"></th>
								<th align="center" valign="top">&#x03B2;</th>
								<th align="center" valign="top">Erro Padr&#x00E3;o</th>
								<th align="center" valign="top">Z(Wald)</th>
								<th align="center" valign="top">Sig.</th>
								<th align="center" valign="top">Exp(&#x03B2;)</th>
								<th align="center" valign="top">(exp(&#x03B2;)-1) x 100</th>
							</tr>
						</thead>
						<tbody style="border-bottom: thin solid; border-color: #000000">
							<tr>
								<td align="left" valign="top">(Intercepto)</td>
								<td align="center" valign="top">0,552</td>
								<td align="center" valign="top">1,568</td>
								<td align="center" valign="top">0,352</td>
								<td align="center" valign="top">0,725</td>
								<td align="center" valign="top">1,737</td>
								<td align="center" valign="top">73,734</td>
							</tr>
							<tr>
								<td align="left" valign="top">Pobreza</td>
								<td align="center" valign="top">1,171</td>
								<td align="center" valign="top">1,419</td>
								<td align="center" valign="top">0,825</td>
								<td align="center" valign="top">0,409</td>
								<td align="center" valign="top">3,224</td>
								<td align="center" valign="top">222,386</td>
							</tr>
							<tr>
								<td align="left" valign="top">Masculino</td>
								<td align="center" valign="top">-0,005</td>
								<td align="center" valign="top">0,560</td>
								<td align="center" valign="top">-0,009</td>
								<td align="center" valign="top">0,993</td>
								<td align="center" valign="top">0,995</td>
								<td align="center" valign="top">-0,484</td>
							</tr>
							<tr>
								<td align="left" valign="top">Idade</td>
								<td align="center" valign="top">-0,014</td>
								<td align="center" valign="top">0,017</td>
								<td align="center" valign="top">-0,830</td>
								<td align="center" valign="top">0,406</td>
								<td align="center" valign="top">0,986</td>
								<td align="center" valign="top">-1,409</td>
							</tr>
							<tr>
								<td align="left" valign="top">Escolaridade</td>
								<td align="center" valign="top">-0,060</td>
								<td align="center" valign="top">0,161</td>
								<td align="center" valign="top">-0,370</td>
								<td align="center" valign="top">0,712</td>
								<td align="center" valign="top">0,942</td>
								<td align="center" valign="top">-5,789</td>
							</tr>
							<tr>
								<td align="left" valign="top">Ideologia</td>
								<td align="center" valign="top">-0,125</td>
								<td align="center" valign="top">0,224</td>
								<td align="center" valign="top">-0,561</td>
								<td align="center" valign="top">0,575</td>
								<td align="center" valign="top">0,882</td>
								<td align="center" valign="top">-11,782</td>
							</tr>
							<tr>
								<td align="left" valign="top">Aumentou Votos</td>
								<td align="center" valign="top">0,908</td>
								<td align="center" valign="top">0,341</td>
								<td align="center" valign="top">2,663</td>
								<td align="center" valign="top">0,008</td>
								<td align="center" valign="top">2,480</td>
								<td align="center" valign="top">148,030</td>
							</tr>
							<tr>
								<td align="left" valign="top">Migrante</td>
								<td align="center" valign="top">0,078</td>
								<td align="center" valign="top">0,382</td>
								<td align="center" valign="top">0,205</td>
								<td align="center" valign="top">0,838</td>
								<td align="center" valign="top">1,081</td>
								<td align="center" valign="top">8,136</td>
							</tr>
							<tr>
								<td align="left" valign="top">
									<bold>Emendas Parlamentares</bold>
								</td>
								<td align="center" valign="top">
									<bold>-0,272</bold>
								</td>
								<td align="center" valign="top">
									<bold>0,639</bold>
								</td>
								<td align="center" valign="top">
									<bold>-0,425</bold>
								</td>
								<td align="center" valign="top">
									<bold>0,671</bold>
								</td>
								<td align="center" valign="top">
									<bold>0,762</bold>
								</td>
								<td align="center" valign="top">
									<bold>-23,785</bold>
								</td>
							</tr>
							<tr>
								<td align="left" valign="top">Candidato/vagas</td>
								<td align="center" valign="top">-0,005</td>
								<td align="center" valign="top">0,009</td>
								<td align="center" valign="top">-0,516</td>
								<td align="center" valign="top">0,606</td>
								<td align="center" valign="top">0,995</td>
								<td align="center" valign="top">-0,469</td>
							</tr>
							<tr>
								<td align="left" valign="top">
									<bold>Despesas de Campanha</bold>
								</td>
								<td align="center" valign="top">
									<bold>0,000</bold>
								</td>
								<td align="center" valign="top">
									<bold>0,000</bold>
								</td>
								<td align="center" valign="top">
									<bold>3,920</bold>
								</td>
								<td align="center" valign="top">
									<bold>0,000</bold>
								</td>
								<td align="center" valign="top">
									<bold>1,000</bold>
								</td>
								<td align="center" valign="top">
									<bold>0,000</bold>
								</td>
							</tr>
							<tr>
								<td align="left" valign="top">
									<bold>Esc&#x00E2;ndalo</bold>
								</td>
								<td align="center" valign="top">
									<bold>-1,677</bold>
								</td>
								<td align="center" valign="top">
									<bold>0,528</bold>
								</td>
								<td align="center" valign="top">
									<bold>-3,176</bold>
								</td>
								<td align="center" valign="top">
									<bold>0,001</bold>
								</td>
								<td align="center" valign="top">
									<bold>0,187</bold>
								</td>
								<td align="center" valign="top">
									<bold>-81,299</bold>
								</td>
							</tr>
						</tbody>
					</table>
					<table-wrap-foot>
						<attrib>Fonte: elabora&#x00E7;&#x00E3;o pr&#x00F3;pria.</attrib>
						<fn id="TFN4">
							<p>Vari&#x00E1;vel dependente: reeleito.</p>
						</fn>
						<fn id="TFN5">
							<label>
								<sup>*</sup>
							</label>
							<p>Como em qualquer modelo de regress&#x00E3;o, os coeficientes n&#x00E3;o padronizados de vari&#x00E1;veis com escalas diferentes n&#x00E3;o podem ser diretamente comparados. O STATA tem o comando (listcoef, std help) que produz coeficientes padronizados na vari&#x00E1;vel independente, dependente e em ambas. 
								<xref ref-type="bibr" rid="B48">Menard (2004)</xref> apresenta seis diferentes formas de padronizar os coeficientes em regress&#x00E3;o log&#x00ED;stica.
							</p>
						</fn>
					</table-wrap-foot>
				</table-wrap>
				<p>Assim como na regress&#x00E3;o linear, o primeiro passo &#x00E9; examinar os coeficientes estimados (&#x03B2;). Aqui o pesquisador deve observar o sinal das estimativas e comparar com a dire&#x00E7;&#x00E3;o esperada em suas hip&#x00F3;teses de trabalho. X
					<sub>11</sub> (Esc&#x00E2;ndalo) tem um efeito negativo (-1,677) sobre a probabilidade de reelei&#x00E7;&#x00E3;o. Diferente do modelo linear, o coeficiente da regress&#x00E3;o log&#x00ED;stica n&#x00E3;o tem uma interpreta&#x00E7;&#x00E3;o direta.
				</p>
				<p>Existem duas principais formas de interpretar os coeficientes: a) analisar a raz&#x00E3;o de chance e b) transformar raz&#x00E3;o de chance em percentual. Pelo primeiro crit&#x00E9;rio, devemos concluir ent&#x00E3;o que o envolvimento em esc&#x00E2;ndalos de corrup&#x00E7;&#x00E3;o reduz a chance de ser reeleito. Em termos percentuais, estar envolvido em corrup&#x00E7;&#x00E3;o diminui em 81,2% a probabilidade de ser reeleito, tal como esperado teoricamente pela hip&#x00F3;tese 1. Ao se considerar a despesa de campanha, o efeito foi nulo, com um Exp (&#x03B2;) = 1,000.</p>
				<p>Assim como Castro e Nunes (2014), n&#x00E3;o encontramos efeitos significativos da vari&#x00E1;vel emendas parlamentares sobre a chance de reelei&#x00E7;&#x00E3;o, levando em conta a magnitude do p-valor e o erro padr&#x00E3;o duas vezes maior do que a pr&#x00F3;pria estimativa do impacto
					<xref ref-type="fn" rid="fn60">
						<sup>27</sup>
					</xref>.
				</p>
				<p>Depois de analisar os coeficientes associados &#x00E0;s vari&#x00E1;veis de interesse, o pr&#x00F3;ximo passo &#x00E9; avaliar a qualidade do ajuste do modelo. A 
					<xref ref-type="table" rid="t17">Tabela 8</xref> sumariza algumas medidas de ajuste tipicamente reportadas em modelos estimados por m&#x00E1;xima verossimilhan&#x00E7;a
					<xref ref-type="fn" rid="fn61">
						<sup>28</sup>
					</xref>.
				</p>
				<table-wrap id="t17">
					<label>Tabela 8</label>
					<caption>
						<title>Medidas de ajuste do modelo
							<xref ref-type="table-fn" rid="TFN6">*</xref>
						</title>
					</caption>
					<table frame="hsides" rules="groups">
						<colgroup width="20%">
							<col/>
							<col/>
							<col/>
							<col/>
							<col/>
						</colgroup>
						<thead style="border-top: thin solid; border-bottom: 2px solid; border-color: #000000">
							<tr>
								<th align="left" valign="top">-2log likelihood null</th>
								<th align="center" valign="top">-2log likelihood</th>
								<th align="center" valign="top">Cox &amp; Snell R
									<sup>2</sup>
								</th>
								<th align="center" valign="top">Nagelkerke R
									<sup>2</sup>
								</th>
								<th align="center" valign="top">BIC</th>
							</tr>
						</thead>
						<tbody style="border-bottom: thin solid; border-color: #000000">
							<tr>
								<td align="left" valign="top">3,057,559</td>
								<td align="center" valign="top">237,4225</td>
								<td align="center" valign="top">0.229</td>
								<td align="center" valign="top">0.308</td>
								<td align="center" valign="top">301,891</td>
							</tr>
						</tbody>
					</table>
					<table-wrap-foot>
						<attrib>Fonte: elabora&#x00E7;&#x00E3;o pr&#x00F3;pria.</attrib>
						<fn id="TFN6">
							<label>
								<sup>*</sup>
							</label>
							<p>A estat&#x00ED;stica - 2 log likelihood (-2LL) &#x00E9; uma medida de ajuste. Quanto menor, melhor &#x00E9; o ajuste. O pesquisador pode utiliz&#x00E1;-la para comparar os ajustes de diferentes modelos (incluindo e retirando vari&#x00E1;veis independentes, mas preservando a mesma vari&#x00E1;vel dependente).</p>
						</fn>
					</table-wrap-foot>
				</table-wrap>
				<p>&#x00C9; comum aparecer nas sa&#x00ED;das dos diferentes pacotes estat&#x00ED;sticos o n&#x00FA;mero de itera&#x00E7;&#x00F5;es utilizadas pelo computador para estimar o modelo. Ao informar que o modelo convergiu ap&#x00F3;s a 5 itera&#x00E7;&#x00E3;o, isso quer dizer que os coeficientes foram estimados via m&#x00E1;xima verossimilhan&#x00E7;a. Em geral, quanto mais r&#x00E1;pido o modelo convergir (menos itera&#x00E7;&#x00F5;es), melhor. Se o modelo n&#x00E3;o convergir, os coeficientes n&#x00E3;o s&#x00E3;o confi&#x00E1;veis. Um dos principais fatores que explicam a n&#x00E3;o convers&#x00E3;o do modelo &#x00E9; a insufici&#x00EA;ncia de casos em rela&#x00E7;&#x00E3;o ao n&#x00FA;mero de vari&#x00E1;veis independentes inclu&#x00ED;das no modelo.</p>
				<p>De acordo com 
					<xref ref-type="bibr" rid="B47">Menard (2002)</xref>, o 
					<italic>log likelihood</italic> &#x00E9; uma medida de sele&#x00E7;&#x00E3;o de par&#x00E2;metros no modelo de regress&#x00E3;o log&#x00ED;stica. No entanto, a maior parte dos pacotes estat&#x00ED;sticos reporta o 
					<italic>-2 log likelihood</italic> (-2LL) e sua interpreta&#x00E7;&#x00E3;o &#x00E9; a seguinte: quanto maior, pior &#x00E9; a capacidade explicativa/preditiva do modelo. Intuitivamente, ele pode ser interpretado como uma medida do erro ao tentar utilizar um determinado conjunto de vari&#x00E1;veis independentes (modelo) para explicar a varia&#x00E7;&#x00E3;o da vari&#x00E1;vel dependente. O pesquisador pode solicitar a 
					<italic>iteration history</italic> da estima&#x00E7;&#x00E3;o. O procedimento vai produzir o 
					<italic>-2 log likelihood</italic> do modelo nulo e do modelo ajustado. A diferen&#x00E7;a entre elas &#x00E9; medida em termos de chi-quadrado. Como ele &#x00E9; uma medida de erro, quanto maior for o chi-quadrado, maior &#x00E9; a redu&#x00E7;&#x00E3;o do erro do modelo ajustado (com as vari&#x00E1;veis independentes) em rela&#x00E7;&#x00E3;o ao modelo nulo.
				</p>
				<p>A 
					<xref ref-type="table" rid="t17">Tabela 8</xref> apresenta o valor do -2LL para facilitar a compara&#x00E7;&#x00E3;o entre os modelos. No modelo nulo, o -2LL era de 3.057.559, e o modelo com as vari&#x00E1;veis independentes foi de 237.4225. Nesse caso, observamos uma redu&#x00E7;&#x00E3;o consider&#x00E1;vel. Isto significa que o modelo com vari&#x00E1;veis independentes tem um ajuste superior ao modelo nulo. Do mesmo modo, o BIC (
					<italic>Bayesian Information Criterion</italic>) &#x00E9; mais uma medida baseada em m&#x00E1;xima verossimilhan&#x00E7;a. Quanto menor, melhor. O modelo testado apresentou um BIC de 301,891, enquanto o do modelo nulo foi 3.066,105. Podemos extrapolar isso e comparar diversos modelos e n&#x00E3;o apenas o nulo.
				</p>
				<p>Diferente do modelo linear, a regress&#x00E3;o log&#x00ED;stica n&#x00E3;o tem uma medida s&#x00ED;ntese da varia&#x00E7;&#x00E3;o na vari&#x00E1;vel dependente explicada pelo modelo, tal como o coeficiente de determina&#x00E7;&#x00E3;o
					<xref ref-type="fn" rid="fn62">
						<sup>29</sup>
					</xref>. No entanto, algumas medidas foram desenvolvidas no sentido de orientar o pesquisador em rela&#x00E7;&#x00E3;o ao poder explicativo/preditivo do modelo
					<xref ref-type="fn" rid="fn63">
						<sup>30</sup>
					</xref>. As mais comumente empregadas s&#x00E3;o o pseudo R
					<sup>2</sup> de Cox e Snell e o pseudo R
					<sup>2</sup> de Nagelkerke
					<xref ref-type="fn" rid="fn64">
						<sup>31</sup>
					</xref>. Para 
					<xref ref-type="bibr" rid="B47">Menard (2002)</xref>,
				</p>
				<disp-quote>
					<p>&#x201C;
						<italic>R</italic>
						<sub>
							<italic>i</italic>
						</sub>
						<sup>2</sup> &#x00E9; uma redu&#x00E7;&#x00E3;o proporcional em -2LL ou uma redu&#x00E7;&#x00E3;o proporcional do valor absoluto do log-likelihood, onde a quantidade sendo minimizada para selecionar os par&#x00E2;metros do modelo &#x00E9; tomada como uma medida da varia&#x00E7;&#x00E3;o&#x201D; (
						<xref ref-type="bibr" rid="B46">Menard, 2002</xref>, p. 25).
					</p>
				</disp-quote>
				<p>Para os prop&#x00F3;sitos deste artigo adotamos a seguinte interpreta&#x00E7;&#x00E3;o: quanto mais pr&#x00F3;ximo de zero, menor &#x00E9; a diferen&#x00E7;a entre o modelo nulo (sem nenhuma vari&#x00E1;vel independente) e o modelo estimado. Quanto mais pr&#x00F3;ximo de um, maior &#x00E9; a diferen&#x00E7;a entre o modelo nulo e o modelo proposto pelo pesquisador. No limite, um pseudo R
					<sup>2</sup> de zero indica que as vari&#x00E1;veis independentes inclu&#x00ED;das n&#x00E3;o ajudam a explicar a varia&#x00E7;&#x00E3;o da vari&#x00E1;vel dependente. Um pseudo R
					<sup>2</sup> de 1 sugere que as vari&#x00E1;veis explicam/predizem perfeitamente a varia&#x00E7;&#x00E3;o de Y. Lembrando que devemos ser menos exigentes com o modelo log&#x00ED;stico do que com o modelo linear em termos de vari&#x00E2;ncia explicada pelo R
					<sup>2</sup>.
				</p>
				<p>Por fim, o pesquisador deve analisar a tabela de classifica&#x00E7;&#x00E3;o (
					<italic>classification table</italic>). Essa sa&#x00ED;da &#x00E9; particularmente interessante pois fornece uma medida da capacidade preditiva do modelo. A 
					<xref ref-type="table" rid="t18">Tabela 9</xref> ilustra as informa&#x00E7;&#x00F5;es de interesse.
				</p>
				<table-wrap id="t18">
					<label>Tabela 9</label>
					<caption>
						<title>Tabela de classifica&#x00E7;&#x00E3;o</title>
					</caption>
					<table frame="hsides" rules="groups">
						<colgroup width="20%">
							<col/>
							<col/>
							<col/>
							<col/>
							<col/>
						</colgroup>
						<thead style="border-top: thin solid; border-bottom: 2px solid; border-color: #000000">
							<tr>
								<th align="left" valign="top"></th>
								<th align="center" valign="top"></th>
								<th align="center" valign="top" colspan="2" style="border-bottom: thin solid;">Predito</th>
								<th align="center" valign="top">Total</th>
							</tr>
							<tr>
								<th align="left" valign="top"></th>
								<th align="center" valign="top"></th>
								<th align="center" valign="top">N&#x00E3;o reeleito</th>
								<th align="center" valign="top">Reeleito</th>
								<th align="center" valign="top"></th>
							</tr>
						</thead>
						<tbody style="border-bottom: thin solid; border-color: #000000">
							<tr>
								<td align="left" valign="top">
									<bold>Real</bold>
								</td>
								<td align="center" valign="top">
									<bold>N&#x00E3;o reeleito</bold>
								</td>
								<td align="center" valign="top">23,50</td>
								<td align="center" valign="top">17,51</td>
								<td align="center" valign="top">41,01</td>
							</tr>
							<tr>
								<td align="left" valign="top"></td>
								<td align="center" valign="top">
									<bold>Reeleito</bold>
								</td>
								<td align="center" valign="top">10,60</td>
								<td align="center" valign="top">48,39</td>
								<td align="center" valign="top">58,99</td>
							</tr>
							<tr>
								<td align="left" valign="top"></td>
								<td align="center" valign="top">
									<bold>Total</bold>
								</td>
								<td align="center" valign="top">34,10</td>
								<td align="center" valign="top">65,90</td>
								<td align="center" valign="top">100,00</td>
							</tr>
						</tbody>
					</table>
					<table-wrap-foot>
						<attrib>Fonte: elabora&#x00E7;&#x00E3;o pr&#x00F3;pria.</attrib>
					</table-wrap-foot>
				</table-wrap>
				<p>A tabela de classifica&#x00E7;&#x00E3;o &#x00E9; chamada, frequentemente, de tabela de confus&#x00E3;o. Para 
					<xref ref-type="bibr" rid="B22">Garson (2011)</xref>,
				</p>
				<disp-quote>
					<p>&#x201C;apesar da classifica&#x00E7;&#x00E3;o correta como medida do ajuste do modelo seja prefer&#x00ED;vel &#x00E0;s medidas de pseudo-R
						<sup>2</sup>, eles t&#x00EA;m algumas limita&#x00E7;&#x00F5;es severas para esta finalidade. Tabelas de Classifica&#x00E7;&#x00E3;o n&#x00E3;o devem ser usadas exclusivamente como medidas de ajuste porque eles ignoram probabilidades preditas reais e em vez disso usam previs&#x00F5;es dicot&#x00F4;micas baseadas em um ponto de corte (ex.: 0,50). Por exemplo, em regress&#x00E3;o log&#x00ED;stica bin&#x00E1;ria, prevendo uma dependente de 0 ou 1, a Tabela Classifica&#x00E7;&#x00E3;o n&#x00E3;o revela qu&#x00E3;o perto de 1 foram as previs&#x00F5;es corretas, nem qu&#x00E3;o perto de 0 foram os erros. Um modelo no qual as predi&#x00E7;&#x00F5;es, corretas ou n&#x00E3;o, est&#x00E3;o muito pr&#x00F3;ximas do ponto de corte 0,5, n&#x00E3;o tem um ajuste t&#x00E3;o bom como um modelo onde os clusters das pontua&#x00E7;&#x00F5;es preditas estejam pr&#x00F3;ximos de 1 ou 0. Al&#x00E9;m disso, porque a taxa de acerto pode variar consideravelmente para o mesmo modelo de log&#x00ED;stica em diferentes amostras, o uso da Tabela Classifica&#x00E7;&#x00E3;o para comparar diferentes amostras n&#x00E3;o &#x00E9; recomendado&#x201D; (
						<xref ref-type="bibr" rid="B22">Garson, 2011</xref>, p. 173).
					</p>
				</disp-quote>
				<p>Nossa matriz de classifica&#x00E7;&#x00E3;o utiliza o padr&#x00E3;o convencional de 50% para alocar os casos como 1 (se a probabilidade predita foi maior que 0,5) ou 0 (menor que 0,5). Podemos avaliar essa tabela utilizando tr&#x00EA;s conceitos: acur&#x00E1;cia, sensibilidade e especificidade. A acur&#x00E1;cia do modelo &#x00E9; a propor&#x00E7;&#x00E3;o de casos verdadeiros positivos e verdadeiros negativos. De acordo com a 
					<xref ref-type="table" rid="t9">Tabela 9</xref>, a acur&#x00E1;cia do nosso modelo foi de 71,89% (23,50% + 48,29%). Contudo, nem sempre a acur&#x00E1;cia do modelo &#x00E9; o mais importante. Em determinados casos, importa maximizar a taxa de verdadeiros positivos, ou verdadeiros negativos.
				</p>
				<p>Passemos &#x00E0; sensibilidade. Ela diz respeito ao percentual de casos que tem a caracter&#x00ED;stica de interesse (foi reeleito) que foram corretamente preditos pelo modelo (verdadeiros positivos / (falsos positivos + verdadeiros positivos). No nosso exemplo, 48,39% dos candidatos reeleitos foram corretamente classificados de um total de 58,99% que realmente foram reeleitos. Isto nos d&#x00E1; uma sensibilidade de 82,03% (48,39%/58,99%). J&#x00E1; a especificidade do modelo diz respeito ao percentual de casos que n&#x00E3;o tem a caracter&#x00ED;stica de interesse (n&#x00E3;o foram reeleitos) que foram corretamente classificados pelo modelo, isto &#x00E9; (verdadeiros negativos / (falsos negativos + verdadeiros negativos)). Como pode ser observado, 23,50% dos candidatos n&#x00E3;o-reeleitos foram corretamente identificados em um total de 41,01% de n&#x00E3;o reeleitos. Isto nos d&#x00E1; uma especificidade de 57,30% (23,50%/41,01%). Existe um 
					<italic>tradeoff</italic> entre sensibilidade e a especificidade. Ao aumentar uma, a outra diminui. Embora &#x00E0;s vezes a sensibilidade do modelo seja mais importante (prever que ter&#x00E1; uma doen&#x00E7;a, j&#x00E1; que voc&#x00EA; poder&#x00E1; trat&#x00E1;-la), outras vezes o melhor seria aumentar a especificidade (impedir que pol&#x00ED;ticos corruptos sejam reeleitos).
				</p>
			</sec>
			<sec>
				<title>VI. Conclus&#x00E3;o</title>
				<p>Esperamos ajudar estudantes e professores a melhor compreenderem o funcionamento da regress&#x00E3;o log&#x00ED;stica. A aus&#x00EA;ncia de cursos de c&#x00E1;lculo, &#x00E1;lgebra linear e matricial e estat&#x00ED;stica avan&#x00E7;ada limita a capacidade de compreender t&#x00E9;cnicas mais avan&#x00E7;adas de an&#x00E1;lise de dados. Por esse motivo, nossa abordagem se concentrou na exposi&#x00E7;&#x00E3;o intuitiva dos resultados. Acreditamos tamb&#x00E9;m que entender a l&#x00F3;gica intuitiva da regress&#x00E3;o log&#x00ED;stica &#x00E9; o primeiro passo para melhor compreender os diferentes procedimentos existentes para lidar com dados categ&#x00F3;ricos. O avan&#x00E7;o computacional permite que pesquisadores com menor treinamento espec&#x00ED;fico em Matem&#x00E1;tica e Estat&#x00ED;stica possam se beneficiar das vantagens associadas &#x00E0;s diferentes t&#x00E9;cnicas multivariadas. Dado que muitas vari&#x00E1;veis em Ci&#x00EA;ncia Pol&#x00ED;tica s&#x00E3;o categ&#x00F3;ricas, os benef&#x00ED;cios anal&#x00ED;ticos associados &#x00E0; correta aplica&#x00E7;&#x00E3;o e interpreta&#x00E7;&#x00E3;o do modelo log&#x00ED;stico s&#x00E3;o evidentes. Com esse artigo, esperamos difundir a utiliza&#x00E7;&#x00E3;o da regress&#x00E3;o log&#x00ED;stica.</p>
				<p>E como melhorar a qualidade do treinamento metodol&#x00F3;gico e t&#x00E9;cnico ofertado aos alunos da gradua&#x00E7;&#x00E3;o e p&#x00F3;s-gradua&#x00E7;&#x00E3;o em Ci&#x00EA;ncia Pol&#x00ED;tica no Brasil? Recomendamos o seguinte: (1) incorporar a replica&#x00E7;&#x00E3;o como ferramenta pedag&#x00F3;gica em cursos de an&#x00E1;lise de dados; (2) incluir cursos obrigat&#x00F3;rios de matem&#x00E1;tica, c&#x00E1;lculo, probabilidade e estat&#x00ED;stica no componente curricular dos cursos de gradua&#x00E7;&#x00E3;o e p&#x00F3;s-gradua&#x00E7;&#x00E3;o. Al&#x00E9;m disso, os alunos devem receber treinamento espec&#x00ED;fico em alguma linguagem de programa&#x00E7;&#x00E3;o; (3) realizar exerc&#x00ED;cios pr&#x00E1;ticos envolvendo an&#x00E1;lise de dados com problemas tipicamente de Ci&#x00EA;ncia Pol&#x00ED;tica. A &#x00EA;nfase em problemas abstratos reduz o interesse dos alunos pelo tema; (4) incentivar a participa&#x00E7;&#x00E3;o dos discentes em cursos de ver&#x00E3;o como o MQ-UFMG e IPSA-USP; (5) promover cursos de epistemologia e filosofia da ci&#x00EA;ncia. A defini&#x00E7;&#x00E3;o dos m&#x00E9;todos e t&#x00E9;cnicas de pesquisa dependem da vis&#x00E3;o epistemol&#x00F3;gica do que &#x00E9; o conhecimento cient&#x00ED;fico e de como ele deve ser implementado; (6) difundir a leitura cr&#x00ED;tica de artigos que utilizam t&#x00E9;cnicas avan&#x00E7;adas de an&#x00E1;lise de dados; (7) acompanhar a produ&#x00E7;&#x00E3;o acad&#x00EA;mica de revistas com &#x00EA;nfase metodol&#x00F3;gica, como, por exemplo, a 
					<italic>Political Analysis</italic> e a 
					<italic>Political Science Research and Methods;</italic> (8) fomentar a publica&#x00E7;&#x00E3;o de artigos metodol&#x00F3;gicos em peri&#x00F3;dicos nacionais; (9) favorecer a cria&#x00E7;&#x00E3;o de grupos de pesquisa e mesas redondas sobre metodologia e t&#x00E9;cnicas em an&#x00E1;lise de dados em congressos profissionais; e (10) financiar projetos de pesquisa especialmente voltados para aprofundar o 
					<italic>status</italic> do conhecimento sobre a principal caracter&#x00ED;stica da ci&#x00EA;ncia: o m&#x00E9;todo.
				</p>
			</sec>
		</body>
		<back>
			<fn-group>
				<fn fn-type="other" id="fn34">
					<label>1</label>
					<p>Materiais de replica&#x00E7;&#x00E3;o dispon&#x00ED;veis em: &lt;https://osf.io/nv4ae/&gt;. Este artigo tamb&#x00E9;m se beneficiou dos coment&#x00E1;rios do professor Jairo Nicolau e das sugest&#x00F5;es recebidas pelos pareceristas an&#x00F4;nimos da 
						<italic>Revista de Sociologia e Pol&#x00ED;tica</italic>. Agradecemos ainda ao 
						<italic>Berkeley Initiative for Transparency in the Social Sciences</italic> e ao 
						<italic>Teaching Integrity in Empirical Research</italic>.
					</p>
				</fn>
				<fn fn-type="other" id="fn35">
					<label>2</label>
					<p>Para uma breve retrospectiva do mensal&#x00E3;o, ver 
						<xref ref-type="bibr" rid="B69">O julgamento do Mensal&#x00E3;o (2012)</xref>.
					</p>
				</fn>
				<fn fn-type="other" id="fn36">
					<label>3</label>
					<p>Para uma apresenta&#x00E7;&#x00E3;o do esc&#x00E2;ndalo dos sanguessugas, ver 
						<xref ref-type="bibr" rid="B70">Entenda o Esc&#x00E2;ndalo dos sanguessugas (2006)</xref>.
					</p>
				</fn>
				<fn fn-type="other" id="fn37">
					<label>4</label>
					<p>Veja o curso sobre regress&#x00E3;o log&#x00ED;stica oferecido pelo 
						<italic>Coursera</italic> (
						<ext-link ext-link-type="uri" xlink:href="https://www.coursera.org/course/logisticregression">https://www.coursera.org/course/logisticregression</ext-link>). Sugerimos tamb&#x00E9;m o curso de an&#x00E1;lise de dados categ&#x00F3;ricos ofertado pelo Programa de Treinamento Intensivo em Metodologia Quantitativa da Universidade Federal de Minas Gerais (MQ &#x2013; UFMG).
					</p>
				</fn>
				<fn fn-type="other" id="fn38">
					<label>5</label>
					<p>N&#x00E3;o discutiremos os fundamentos matem&#x00E1;ticos da regress&#x00E3;o log&#x00ED;stica. Para leitores interessados no assunto sugerimos ver 
						<xref ref-type="bibr" rid="B44">Long (1997)</xref> e 
						<xref ref-type="bibr" rid="B53">Pampel (2000)</xref>.
					</p>
				</fn>
				<fn fn-type="other" id="fn39">
					<label>6</label>
					<p>Existem extens&#x00F5;es do modelo log&#x00ED;stico que permitem modelar a varia&#x00E7;&#x00E3;o de vari&#x00E1;veis ordinais (regress&#x00E3;o log&#x00ED;stica ordinal) e policot&#x00F4;micas (regress&#x00E3;o log&#x00ED;stica multinomial).</p>
				</fn>
				<fn fn-type="other" id="fn40">
					<label>7</label>
					<p>
						<xref ref-type="bibr" rid="B49">Nelder e Wedderburn (1972)</xref> demonstraram que &#x00E9; poss&#x00ED;vel utilizar o mesmo algoritmo para estimar modelos da fam&#x00ED;lia da distribui&#x00E7;&#x00E3;o exponencial, tais como Log&#x00ED;stico, Probit, Poisson, Gama e Normal Inversa. N&#x00E3;o se preocupe com as f&#x00F3;rmulas desses modelos. O importante &#x00E9; compreender para que serve cada um deles, quando devem ser utilizados e como os coeficientes devem ser interpretados.
					</p>
				</fn>
				<fn fn-type="other" id="fn41">
					<label>8</label>
					<p>
						<xref ref-type="bibr" rid="B27">Hair 
							<italic>et al.</italic> (2009)
						</xref> afirmam que homocedasticidade refere-se ao pressuposto de que a vari&#x00E1;vel dependente exibe n&#x00ED;veis iguais de vari&#x00E2;ncia em toda a gama de vari&#x00E1;vel preditora (Hair 
						<italic>et al.</italic>, 2009, p. 83).
					</p>
				</fn>
				<fn fn-type="other" id="fn42">
					<label>9</label>
					<p>Para 
						<xref ref-type="bibr" rid="B27">Hair 
							<italic>et al.</italic> (2009)
						</xref>, um pressuposto impl&#x00ED;cito de todas as t&#x00E9;cnicas de an&#x00E1;lise multivariada com base em medidas correlacionais de associa&#x00E7;&#x00E3;o, incluindo regress&#x00E3;o linear m&#x00FA;ltipla e regress&#x00E3;o log&#x00ED;stica, &#x00E9; a linearidade (Hair 
						<italic>et al.</italic>, 2009, p. 85).
					</p>
				</fn>
				<fn fn-type="other" id="fn43">
					<label>10</label>
					<p>Um estimador &#x00E9; 
						<italic>Best Linear Unbiased Estimator</italic> quando as seguintes propriedades s&#x00E3;o satisfeitas. Melhor significa eficiente, que produz a menor vari&#x00E2;ncia, linear refere-se ao tipo de rela&#x00E7;&#x00E3;o esperada entre os par&#x00E2;metros e n&#x00E3;o-viesamento diz respeito &#x00E0; distribui&#x00E7;&#x00E3;o amostral do estimador. Um estimador viesado &#x00E9; aquele que sistematicamente superestima ou subestima o valor do par&#x00E2;metro populacional.
					</p>
				</fn>
				<fn fn-type="other" id="fn44">
					<label>11</label>
					<p>Os dados est&#x00E3;o dispon&#x00ED;veis em: &lt;http://www.ats.ucla.edu/stat/stata/examples/alr2/alr2stata1.htm&gt;.</p>
				</fn>
				<fn fn-type="other" id="fn45">
					<label>12</label>
					<p>A regress&#x00E3;o log&#x00ED;stica tamb&#x00E9;m acomoda vari&#x00E1;veis com mais de duas categorias. Quando n&#x00E3;o existe hierarquia entre as categorias, como por exemplo na distribui&#x00E7;&#x00E3;o do estado civil, devemos utilizar a regress&#x00E3;o multinomial. Por sua vez, a regress&#x00E3;o log&#x00ED;stica ordinal &#x00E9; ideal para modelar a distribui&#x00E7;&#x00E3;o de vari&#x00E1;veis ordinais, ou seja, quando existe uma estrutura de intensidade entre as categorias.</p>
				</fn>
				<fn fn-type="other" id="fn46">
					<label>13</label>
					<p>A categoriza&#x00E7;&#x00E3;o de vari&#x00E1;veis tende a produzir estimativas viesadas e ineficientes (
						<xref ref-type="bibr" rid="B67">Taylor &amp; Yu, 2002</xref>). Por esse motivo, enfatizamos o termo &#x201C;originalmente dicot&#x00F4;micas&#x201D; e recomendamos nunca reduzir o n&#x00ED;vel de mensura&#x00E7;&#x00E3;o de vari&#x00E1;veis cont&#x00ED;nuas, discretas ou ordinais com o objetivo de aplicar modelos de regress&#x00E3;o log&#x00ED;stica. Ainda na d&#x00FA;vida? Veja 
						<xref ref-type="bibr" rid="B12">Fernandes 
							<italic>et al.</italic> (2019)
						</xref>.
					</p>
				</fn>
				<fn fn-type="other" id="fn47">
					<label>14</label>
					<p>Quando a correla&#x00E7;&#x00E3;o &#x00E9; muito alta (alguns usam a regra de ouro de r &#x2265; 0,90), o erro padr&#x00E3;o dos coeficientes &#x00E9; grande, dificultando avaliar a import&#x00E2;ncia relativa das vari&#x00E1;veis explicativas. Para entender melhor os problemas que altos n&#x00ED;veis de correla&#x00E7;&#x00E3;o entre as vari&#x00E1;veis independentes podem gerar, ver Figueiredo, Silva e Domingos (2015).</p>
				</fn>
				<fn fn-type="other" id="fn48">
					<label>15</label>
					<p>Para uma introdu&#x00E7;&#x00E3;o sobre como detectar 
						<italic>outliers</italic>, ver 
						<xref ref-type="bibr" rid="B14">Figueiredo Filho e Silva (2016)</xref>, dispon&#x00ED;vel em: &lt;https://cienciapolitica.org.br/system/files/documentos/eventos/2017/04/outlier-que-pertuba-seu-sono-como-identificar-e-manejar.pdf&gt;.
					</p>
				</fn>
				<fn fn-type="other" id="fn49">
					<label>16</label>
					<p>O pesquisador pode disponibilizar os dados no 
						<italic>Dataverse</italic> da Universidade de Harvard. O 
						<italic>Open Science Framework</italic> tamb&#x00E9;m pode ser utilizado para disponibiliza&#x00E7;&#x00E3;o de dados em projetos mais amplos. No Brasil, sugerimos o Cons&#x00F3;rcio de Informa&#x00E7;&#x00F5;es Sociais (CIS).
					</p>
				</fn>
				<fn fn-type="other" id="fn50">
					<label>17</label>
					<p>No modelo linear, o coeficiente de regress&#x00E3;o &#x00E9; interpretado como a varia&#x00E7;&#x00E3;o observada na vari&#x00E1;vel dependente (Y) quando a vari&#x00E1;vel independente (X) aumenta em uma unidade. Na regress&#x00E3;o log&#x00ED;stica, o coeficiente indica a varia&#x00E7;&#x00E3;o no logaritmo da chance da vari&#x00E1;vel dependente ao se elevar a vari&#x00E1;vel explicativa em uma unidade.</p>
				</fn>
				<fn fn-type="other" id="fn51">
					<label>18</label>
					<p>Leitores pouco familiarizados com o conceito de chance devem consultar o 
						<xref ref-type="app" rid="app2">Ap&#x00EA;ndice</xref> metodol&#x00F3;gico deste artigo antes de continuar a leitura. Para um tratamento mais detalhado, ver 
						<xref ref-type="bibr" rid="B29">Hilbe (2009)</xref>.
					</p>
				</fn>
				<fn fn-type="other" id="fn52">
					<label>19</label>
					<p>Na hora de interpretar a signific&#x00E2;ncia estat&#x00ED;stica do intervalo de confian&#x00E7;a do coeficiente de regress&#x00E3;o da raz&#x00E3;o de chance devemos observar se o intervalo inclui o valor um (1). Em caso afirmativo, estamos diante de um resultado n&#x00E3;o significativo. Por exemplo, em um intervalo de confian&#x00E7;a em que o coeficiente varia entre 0,8 e 1,6, n&#x00E3;o &#x00E9; poss&#x00ED;vel rejeitar a hip&#x00F3;tese nula.</p>
				</fn>
				<fn fn-type="other" id="fn53">
					<label>20</label>
					<p>Seguindo as melhores pr&#x00E1;ticas cient&#x00ED;ficas, os autores disponibilizaram os dados e 
						<italic>scripts</italic> no seguinte endere&#x00E7;o eletr&#x00F4;nico: &lt;http://thedata.harvard.edu/dvn/dv/felipenunes&gt;.
					</p>
				</fn>
				<fn fn-type="other" id="fn54">
					<label>21</label>
					<p>A principal vantagem de utilizar a codifica&#x00E7;&#x00E3;o 0/1 &#x00E9; que a m&#x00E9;dia da distribui&#x00E7;&#x00E3;o ser&#x00E1; igual &#x00E0; propor&#x00E7;&#x00E3;o de casos 1 na amostra. Em uma distribui&#x00E7;&#x00E3;o com 100 ocorr&#x00EA;ncias, em que 25 casos foram codificados como 1, a m&#x00E9;dia ser&#x00E1; 0,25, o que representa exatamente a propor&#x00E7;&#x00E3;o de eventos codificados como 1.</p>
				</fn>
				<fn fn-type="other" id="fn55">
					<label>22</label>
					<p>Castro e Nunes (2014) estimaram o modelo de regress&#x00E3;o a partir da fun&#x00E7;&#x00E3;o de liga&#x00E7;&#x00E3;o probit. A fun&#x00E7;&#x00E3;o logit &#x00E9; mais adequada para trabalhar com amostras pequenas (n &lt; 20) uma vez que apresenta maior taxa de converg&#x00EA;ncia. Em amostras grandes, por outro lado, n&#x00E3;o existem diferen&#x00E7;as significativas entre essas fun&#x00E7;&#x00F5;es de liga&#x00E7;&#x00E3;o. Para mais informa&#x00E7;&#x00F5;es sobre o assunto, ver 
						<xref ref-type="bibr" rid="B20">Freitas (2013)</xref>.
					</p>
				</fn>
				<fn fn-type="other" id="fn56">
					<label>23</label>
					<p>O pesquisador deve se certificar de que nenhuma categoria tenha uma distribui&#x00E7;&#x00E3;o inferior a 5%. Isso porque enquadra o fen&#x00F4;meno como evento raro, sendo necess&#x00E1;rio aplicar corre&#x00E7;&#x00F5;es espec&#x00ED;ficas para lidar com essa situa&#x00E7;&#x00E3;o. Para os leitores interessados ver 
						<xref ref-type="bibr" rid="B39">King e Zeng (2001)</xref>.
					</p>
				</fn>
				<fn fn-type="other" id="fn57">
					<label>24</label>
					<p>Esses achados divergem residualmente das informa&#x00E7;&#x00F5;es reportadas pelas Tabelas 4 e 5 de Castro e Nunes (2014) que indica 9 reeleitos de um total de 50 parlamentares, o que equivale a 18%.</p>
				</fn>
				<fn fn-type="other" id="fn58">
					<label>25</label>
					<p>E isso pode ser calculado a partir da raz&#x00E3;o de chance, que &#x00E9; calculada pela divis&#x00E3;o entre as chances de reelei&#x00E7;&#x00E3;o de cada grupo, no caso, 1,9/0,22. Ou seja, candidatos n&#x00E3;o envolvidos em esc&#x00E2;ndalos de corrup&#x00E7;&#x00E3;o tem cerca de 8 vezes mais chance de serem reeleitos quando comparados com os deputados citados nos esquemas do mensal&#x00E3;o e/ou sanguessugas, assim como mensurado por Castro e Nunes (2014).</p>
				</fn>
				<fn fn-type="other" id="fn59">
					<label>26</label>
					<p>Para 
						<xref ref-type="bibr" rid="B22">Garson (2011)</xref>, o teste omnibus pode ser interpretado como um teste para a capacidade conjunta de todos os preditores do modelo preverem a vari&#x00E1;vel resposta (dependente). Um resultado significativo indica que o ajuste est&#x00E1; adequado aos dados, sugerindo que pelo menos um dos preditores &#x00E9; significativamente relacionado com a vari&#x00E1;vel resposta.
					</p>
				</fn>
				<fn fn-type="other" id="fn60">
					<label>27</label>
					<p>No original, &#x201C;a aloca&#x00E7;&#x00E3;o bem-sucedida de recursos particularistas 
						<italic>(pork)</italic> n&#x00E3;o apresenta, diferentemente do que era esperado, associa&#x00E7;&#x00E3;o positiva com reelei&#x00E7;&#x00E3;o. O resultado parece ser nulo e n&#x00E3;o relevante para explicar as chances de reelei&#x00E7;&#x00E3;o, em 2006, tamb&#x00E9;m quando vari&#x00E1;veis socioecon&#x00F4;micas e institucionais s&#x00E3;o inclu&#x00ED;das no modelo&#x201D; (Castro &amp; Nunes, 2014, p. 42).
					</p>
				</fn>
				<fn fn-type="other" id="fn61">
					<label>28</label>
					<p>O m&#x00E9;todo de m&#x00E1;xima verossimilhan&#x00E7;a &#x00E9; um processo iterativo que procura ajustar o modelo atrav&#x00E9;s de v&#x00E1;rias repeti&#x00E7;&#x00F5;es. No entanto, algumas vezes o modelo simplesmente n&#x00E3;o converge. Isso pode acontecer por v&#x00E1;rios motivos, desde problemas nos algoritmos utilizados para estimar a fun&#x00E7;&#x00E3;o de liga&#x00E7;&#x00E3;o at&#x00E9; a distribui&#x00E7;&#x00E3;o fortemente assim&#x00E9;trica das vari&#x00E1;veis independentes.</p>
				</fn>
				<fn fn-type="other" id="fn62">
					<label>29</label>
					<p>Existe um debate sobre as vantagens e limita&#x00E7;&#x00F5;es do r
						<sup>2</sup> como medida s&#x00ED;ntese para avaliar a qualidade do ajuste dos modelos de regress&#x00E3;o linear. Salvo melhor ju&#x00ED;zo, 
						<xref ref-type="bibr" rid="B37">King (1986)</xref> representa o primeiro alerta sistem&#x00E1;tico sobre o assunto na pesquisa emp&#x00ED;rica em Ci&#x00EA;ncia Pol&#x00ED;tica. 
						<xref ref-type="bibr" rid="B18">Figueiredo Filho, Silva J&#x00FA;nior e Rocha (2012)</xref> apresentam uma discuss&#x00E3;o pedag&#x00F3;gica sobre o tema.
					</p>
				</fn>
				<fn fn-type="other" id="fn63">
					<label>30</label>
					<p>
						<xref ref-type="bibr" rid="B27">Hair 
							<italic>et al.</italic> (2009)
						</xref> afirmam que o ajuste do modelo log&#x00ED;stico pode ser avaliado a partir de dois principais procedimentos: (1) os pseudo r2s, similarmente &#x00E0; regress&#x00E3;o linear e (2) estimar a capacidade preditiva do modelo.
					</p>
				</fn>
				<fn fn-type="other" id="fn64">
					<label>31</label>
					<p>Existem ainda o McFadden&#x2019;s pseudo R
						<sup>2</sup>, McKelvey e Savoina pseudo R
						<sup>2</sup>, McFadden pseudo R
						<sup>2</sup> ajustado, Cragg e Uhler pseudo R
						<sup>2</sup> e Efron pseudo R
						<sup>2</sup>. Para o leitor interessado em aprofundar seus conhecimentos sobre o assunto ver 
						<xref ref-type="bibr" rid="B26">Hagle e Mitchell (1992)</xref> e 
						<xref ref-type="bibr" rid="B46">Menard (2000)</xref>.
					</p>
				</fn>
				<fn fn-type="other" id="fn65">
					<label>32</label>
					<p>Esta se&#x00E7;&#x00E3;o foi baseada em 
						<xref ref-type="bibr" rid="B62">Schwab (2002)</xref>.
					</p>
				</fn>
				<fn fn-type="other" id="fn66">
					<p>A produ&#x00E7;&#x00E3;o desse manuscrito foi viabilizada atrav&#x00E9;s do patroc&#x00ED;nio fornecido pelo Centro Universit&#x00E1;rio Internacional Uninter &#x00E0; 
						<italic>Revista de Sociologia e Pol&#x00ED;tica</italic>.
					</p>
				</fn>
			</fn-group>
			<app-group>
				<app id="app2">
					<title>Ap&#x00EA;ndice</title>
					<p>Nesta se&#x00E7;&#x00E3;o apresentamos algumas informa&#x00E7;&#x00F5;es que podem auxiliar o pesquisador na interpreta&#x00E7;&#x00E3;o dos coeficientes da regress&#x00E3;o log&#x00ED;stica. Em particular, examinamos a interpreta&#x00E7;&#x00E3;o da raz&#x00E3;o de chance. Al&#x00E9;m disso, listamos algumas ferramentas de aprendizagem.</p>
					<p>&#x2022; Entendendo a raz&#x00E3;o de chance (
						<italic>odds ratio</italic>)
						<xref ref-type="fn" rid="fn65">
							<sup>32</sup>
						</xref>
					</p>
					<p>O termo raz&#x00E3;o de chance n&#x00E3;o &#x00E9; t&#x00E3;o difundido na pesquisa aplicada em Ci&#x00EA;ncia Pol&#x00ED;tica como m&#x00E9;dia ou probabilidade. Em geral, como o pesquisador est&#x00E1; comparando grupos/categorias, ele se interessa em analisar que grupo/categoria tem mais chance de ocorrer em rela&#x00E7;&#x00E3;o ao outro grupo/categoria. Considere o seguinte exemplo: suponha que a probabilidade (p) de ocorr&#x00EA;ncia um determinado evento &#x00E9; de 0,9. Dessa forma, ao se calcular o complementar, q&nbsp;= 1 - p, ent&#x00E3;o 1 - 0,9 = 0,1. Chance &#x00E9; a divis&#x00E3;o da probabilidade de ocorr&#x00EA;ncia (p) pela probabilidade de n&#x00E3;o ocorr&#x00EA;ncia (q). Ent&#x00E3;o, 0,9/0,1 = 9. Afirma-se, ent&#x00E3;o, que a chance de sucesso &#x00E9; 9 para 1. Por sua vez, a chance de fracasso d&#x00E1;-se por 0,1/0,9 = 0,11. Dizemos, ent&#x00E3;o, que a chance de fracasso &#x00E9; de 1 para 9. Diferente da probabilidade que apenas pode assumir valores entre 0 e 1, a chance pode variar de 0 a infinito. Quando a probabilidade de ocorr&#x00EA;ncia de um evento &#x00E9; maior do que a probabilidade de n&#x00E3;o ocorr&#x00EA;ncia, a chance ser&#x00E1; maior do que 1. Quando a probabilidade de n&#x00E3;o ocorr&#x00EA;ncia &#x00E9; maior, a chance ser&#x00E1; menor do que 1. Quando as probabilidades s&#x00E3;o iguais (ex. lan&#x00E7;amento de uma moeda), a chance &#x00E9; igual a 1. Dado os prop&#x00F3;sitos pedag&#x00F3;gicos deste artigo, &#x00E9; importante replicar os dados de Schawb (2002) para melhor entender esse conceito (
						<xref ref-type="table" rid="t3a">Tabela 1A</xref>).
					</p>
					<table-wrap id="t3a">
						<label>Tabela 1A</label>
						<caption>
							<title>Frequ&#x00EA;ncia</title>
						</caption>
						<table frame="hsides" rules="groups">
							<colgroup width="33%">
								<col/>
								<col/>
								<col/>
							</colgroup>
							<thead style="border-top: thin solid; border-bottom: 2px solid; border-color: #000000">
								<tr>
									<th align="left" valign="top">Pena</th>
									<th align="center" valign="top">N</th>
									<th align="center" valign="top">%</th>
								</tr>
							</thead>
							<tbody style="border-bottom: thin solid; border-color: #000000">
								<tr>
									<td align="left" valign="top">Pena de morte</td>
									<td align="center" valign="top">50</td>
									<td align="center" valign="top">34</td>
								</tr>
								<tr>
									<td align="left" valign="top">Pris&#x00E3;o perp&#x00E9;tua</td>
									<td align="center" valign="top">97</td>
									<td align="center" valign="top">66</td>
								</tr>
								<tr>
									<td align="left" valign="top">
										<bold>Total</bold>
									</td>
									<td align="center" valign="top">
										<bold>147</bold>
									</td>
									<td align="center" valign="top">
										<bold>100,0</bold>
									</td>
								</tr>
							</tbody>
						</table>
						<table-wrap-foot>
							<attrib>Fonte: 
								<xref ref-type="bibr" rid="B62">Schwab (2002)</xref>.
							</attrib>
						</table-wrap-foot>
					</table-wrap>
					<p>A 
						<xref ref-type="table" rid="t3a">Tabela 1A</xref> mostra que 34% dos presos foram condenados &#x00E0; pena de morte (n = 50/147). Isso quer dizer que a probabilidade de ocorr&#x00EA;ncia desse evento &#x00E9; de 0,34. Por sua vez, a chance de ser condenado &#x00E0; pena capital &#x00E9; de 0,516 (50/97). Outra forma de dizer &#x00E9; que se tem aproximadamente a metade da chance de ser condenado &#x00E0; pena capital em rela&#x00E7;&#x00E3;o a passar o resto da vida na pris&#x00E3;o. Por fim, &#x00E9; poss&#x00ED;vel inverter a interpreta&#x00E7;&#x00E3;o e considerar que a pris&#x00E3;o perp&#x00E9;tua &#x00E9; cerca de duas vezes mais prov&#x00E1;vel do que a pena de morte.
					</p>
					<p>At&#x00E9; ent&#x00E3;o n&#x00E3;o se tem nenhuma vari&#x00E1;vel independente. O que o modelo log&#x00ED;stico vai informar &#x00E9; o impacto de uma determinada vari&#x00E1;vel sobre a chance de ocorr&#x00EA;ncia de vari&#x00E1;vel dependente. Por exemplo, considere a rela&#x00E7;&#x00E3;o entre cor e tipo de senten&#x00E7;a (
						<xref ref-type="table" rid="t4a">Tabela 2A</xref>).
					</p>
					<table-wrap id="t4a">
						<label>Tabela 2A</label>
						<caption>
							<title>Tipo de pena por cor</title>
						</caption>
						<table frame="hsides" rules="groups">
							<colgroup width="25%">
								<col/>
								<col/>
								<col/>
								<col/>
							</colgroup>
							<thead style="border-top: thin solid; border-bottom: 2px solid; border-color: #000000">
								<tr>
									<th align="left" valign="top">Pena</th>
									<th align="center" valign="top">Negros</th>
									<th align="center" valign="top">N&#x00E3;o Negros</th>
									<th align="center" valign="top">Total</th>
								</tr>
							</thead>
							<tbody style="border-bottom: thin solid; border-color: #000000">
								<tr>
									<td align="left" valign="top">Pena de morte</td>
									<td align="center" valign="top">28</td>
									<td align="center" valign="top">22</td>
									<td align="center" valign="top">50</td>
								</tr>
								<tr>
									<td align="left" valign="top">Pris&#x00E3;o perp&#x00E9;tua</td>
									<td align="center" valign="top">45</td>
									<td align="center" valign="top">52</td>
									<td align="center" valign="top">97</td>
								</tr>
								<tr>
									<td align="left" valign="top">
										<bold>Total</bold>
									</td>
									<td align="center" valign="top">
										<bold>73</bold>
									</td>
									<td align="center" valign="top">
										<bold>74</bold>
									</td>
									<td align="center" valign="top">
										<bold>147</bold>
									</td>
								</tr>
							</tbody>
						</table>
						<table-wrap-foot>
							<attrib>Fonte: 
								<xref ref-type="bibr" rid="B62">Schwab (2002)</xref>.
							</attrib>
						</table-wrap-foot>
					</table-wrap>
					<p>&#x00C9; poss&#x00ED;vel ent&#x00E3;o calcular a chance para cada grupo espec&#x00ED;fico: negros e n&#x00E3;o-negros. Para os negros tem-se 28/45 = 0,622. Para os n&#x00E3;o-negros tem-se 22/52 = 0,423. O impacto de ser negro pode ser representado pela divis&#x00E3;o da chance do negro receber pena de morte (0,622) pela chance de um n&#x00E3;o negro receber a pena capital (0,423). 0,622/0,423 = 1,47. Para interpretar: a) negros tem 1,47 mais chance de receber a pena de morte do que n&#x00E3;o negros; b) ser negro aumenta 47% a chance de receber a pena capital (1,47-1*100).</p>
					<sec>
						<title>Ferramentas de aprendizagem</title>
						<p>
							<ext-link ext-link-type="uri" xlink:href="http://www.icpsr.umich.edu/icpsrweb/sumprog/">http://www.icpsr.umich.edu/icpsrweb/sumprog/</ext-link>
						</p>
						<p>No plano internacional, o 
							<italic>Summer Program in Quantitative Methods of Social Research</italic> (ICPRS) &#x00E9; uma das principais iniciativas na difus&#x00E3;o de m&#x00E9;todos e t&#x00E9;cnicas de pesquisa.
						</p>
						<p>
							<ext-link ext-link-type="uri" xlink:href="http://www.fafich.ufmg.br/~mq/index.html">http://www.fafich.ufmg.br/~mq/index.html</ext-link>
						</p>
						<p>Curso intensivo de Metodologia Quantitativa em Ci&#x00EA;ncias Humanas. &#x00C9; o curso mais tradicional no ensino de m&#x00E9;todos e t&#x00E9;cnicas de pesquisa em Ci&#x00EA;ncias Sociais no Brasil.</p>
						<p>
							<ext-link ext-link-type="uri" xlink:href="http://summerschool.ipsa.org/">http://summerschool.ipsa.org/</ext-link>
						</p>
						<p>Curso de ver&#x00E3;o organizado pela Associa&#x00E7;&#x00E3;o Internacional de Ci&#x00EA;ncia Pol&#x00ED;tica, Departamento de Ci&#x00EA;ncia Pol&#x00ED;tica e o Instituto de Rela&#x00E7;&#x00F5;es da Universidade de S&#x00E3;o Paulo (USP).</p>
						<p>
							<ext-link ext-link-type="uri" xlink:href="http://gking.harvard.edu/">http://gking.harvard.edu/</ext-link>
						</p>
						<p>Gary King disponibiliza artigos sobre metodologia, 
							<italic>softwares</italic> espec&#x00ED;ficos e bancos de dados para pesquisadores interessados em fazer replica&#x00E7;&#x00F5;es.
						</p>
						<p>
							<ext-link ext-link-type="uri" xlink:href="http://faculty.chass.ncsu.edu/garson/PA765/statnote.htm">http://faculty.chass.ncsu.edu/garson/PA765/statnote.htm</ext-link>
						</p>
						<p>David Garson apresenta diferentes t&#x00F3;picos em estat&#x00ED;stica multivariada utilizando o Statistical Package for Social Sciences. Ao final de cada se&#x00E7;&#x00E3;o, tem-se uma bibliografia sugerida que pode ser utilizada como refer&#x00EA;ncia para ganhar mais profundidade no assunto.</p>
						<p>
							<ext-link ext-link-type="uri" xlink:href="http://www.statsoft.com/textbook/">http://www.statsoft.com/textbook/</ext-link>
						</p>
						<p>Apresenta diferentes t&#x00E9;cnicas multivariadas utilizando o software Statistica.</p>
						<p>
							<ext-link ext-link-type="uri" xlink:href="http://www.ats.ucla.edu/stat/">http://www.ats.ucla.edu/stat/</ext-link>
						</p>
						<p>S&#x00ED;tio eletr&#x00F4;nico da Universidade da Calif&#x00F3;rnia (UCLA) especializado em t&#x00E9;cnicas multivariadas. Aqui o usu&#x00E1;rio encontra aplica&#x00E7;&#x00F5;es de diferentes softwares (SAS, SPSS, STATA, R, etc.), inclusive com v&#x00ED;deo aulas e tutoriais.</p>
						<p>
							<ext-link ext-link-type="uri" xlink:href="http://www.socr.ucla.edu/SOCR.html">http://www.socr.ucla.edu/SOCR.html</ext-link>
						</p>
						<p>Nesse endere&#x00E7;o o leitor encontra jogos, aplica&#x00E7;&#x00F5;es, an&#x00E1;lises, entre outras ferramentas relacionadas ao ensino de Estat&#x00ED;stica e diferentes t&#x00E9;cnicas de pesquisa.</p>
						<p>
							<ext-link ext-link-type="uri" xlink:href="http://pan.oxfordjournals.org/">http://pan.oxfordjournals.org/</ext-link>
						</p>
						<p>Political Analysis &#x00E9; um dos peri&#x00F3;dicos mais influentes da Ci&#x00EA;ncia Pol&#x00ED;tica contempor&#x00E2;nea e publica artigos na &#x00E1;rea de metodologia.</p>
						<p>
							<ext-link ext-link-type="uri" xlink:href="http://www.amstat.org/publications/jse/">http://www.amstat.org/publications/jse/</ext-link>
						</p>
						<p>Peri&#x00F3;dico especializado na divulga&#x00E7;&#x00E3;o de t&#x00E9;cnicas de ensino e aprendizagem de Estat&#x00ED;stica.</p>
						<p>
							<ext-link ext-link-type="uri" xlink:href="http://www.politicahoje.ufpe.br/index.php/politica">http://www.politicahoje.ufpe.br/index.php/politica</ext-link>
						</p>
						<p>A Revista Pol&#x00ED;tica Hoje do Departamento de Ci&#x00EA;ncia Pol&#x00ED;tica da UFPE publicou recentemente uma edi&#x00E7;&#x00E3;o especial dedicada a Metodologia e Epistemologia em Ci&#x00EA;ncia Pol&#x00ED;tica e Rela&#x00E7;&#x00F5;es Internacionais.</p>
					</sec>
				</app>
			</app-group>
		</back>
	</sub-article-->
</article>