<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Healthcare Economist &#187; Econometrics</title>
	<atom:link href="http://healthcare-economist.com/category/econometrics/feed/" rel="self" type="application/rss+xml" />
	<link>http://healthcare-economist.com</link>
	<description></description>
	<lastBuildDate>Fri, 25 May 2012 15:31:12 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
		<item>
		<title>Fixed Effects Logit Estimator and the Incidental Parameter Problem</title>
		<link>http://healthcare-economist.com/2012/05/23/fixed-effects-logit-estimator-and-the-incidental-parameter-problem/</link>
		<comments>http://healthcare-economist.com/2012/05/23/fixed-effects-logit-estimator-and-the-incidental-parameter-problem/#comments</comments>
		<pubDate>Wed, 23 May 2012 13:18:11 +0000</pubDate>
		<dc:creator>Jason Shafrin</dc:creator>
				<category><![CDATA[Econometrics]]></category>
		<category><![CDATA[Fixed Effects]]></category>
		<category><![CDATA[Incidental Parameter Problem]]></category>
		<category><![CDATA[Logit]]></category>
		<category><![CDATA[Probit]]></category>

		<guid isPermaLink="false">http://healthcare-economist.com/?p=6454</guid>
		<description><![CDATA[Fixed effects (FE) regressions are a useful tool for controlling for time-invariant factors in a regression specification.  When using a linear OLS model, FE represent the average value of the dependent variable for that individual after controlling for covariates.  Estimating a fixed effects model for non-linear regressions, however, can be problematic. For instance, if you [...]]]></description>
			<content:encoded><![CDATA[<p>Fixed effects (FE) regressions are a useful tool for controlling for time-invariant factors in a regression specification.  When using a linear OLS model, FE represent the average value of the dependent variable for that individual after controlling for covariates.  Estimating a fixed effects model for non-linear regressions, however, can be problematic.</p>
<p>For instance, if you try to estimate the fixed effects coefficients in a probit model, you will introduce an incidental parameters problem.  Assume that the panel data has <em>N</em> individuals over <em>T</em> time periods.  If <em>T</em> is fixed, as <em>N</em> grows large (i.e., <em>N</em>→∞) your covariate estimates (<em>β</em>) become biased.  This occurs because the number of &#8220;nuisance parameters&#8221; grow quickly as <em>N</em> increases.</p>
<p>There does exist a &#8220;fixed effects logit estimator&#8221;, but this estimator does not actually use a fixed effects method.  Rather it is a conditional maximum likelihood estimator (cMLE).  In the two period model, it conditions on the fact that the event occurred in one or the other time period.</p>
<ul>
<li>Pr(y<sub>i1</sub>=1|y<sub>i1</sub>+y<sub>i2</sub>=1) = 1-<em>F(Δxβ)</em></li>
<li>Pr(y<sub>i2</sub>=1|y<sub>i1</sub>+y<sub>i2</sub>=1) = <em>F(Δxβ)</em></li>
</ul>
<p>&#8220;The conditional-likelihood estimator is thus equivalent to a logit estimator of the dependent variable <em>1(Δy=1)</em> on the independent variables <em>Δx</em> for the subsample of observations satisfying <em>y<sub>i1</sub> + y<sub>i2</sub>=1</em>.&#8221;</p>
<p><a href="http://www.sciencedirect.com/science/article/pii/S016517659700044X">Abrevaya (1996)</a> provides an example of how this bias can occur in a two period example and explains the conditional logit model in more detail.</p>
<ul>
<li>Jason Abrevaya (1996) &#8220;<a href="http://www.sciencedirect.com/science/article/pii/S016517659700044X">The equivalence of two estimators of the fixed-effects logit model</a>&#8221; <em>Economic Letters</em>  Volume 55, Issue 1, 15 August 1997, Pages 41–4</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://healthcare-economist.com/2012/05/23/fixed-effects-logit-estimator-and-the-incidental-parameter-problem/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Attrition Bias</title>
		<link>http://healthcare-economist.com/2012/05/02/attrition-bias/</link>
		<comments>http://healthcare-economist.com/2012/05/02/attrition-bias/#comments</comments>
		<pubDate>Wed, 02 May 2012 07:22:36 +0000</pubDate>
		<dc:creator>Jason Shafrin</dc:creator>
				<category><![CDATA[Econometrics]]></category>
		<category><![CDATA[Attrition Bias]]></category>
		<category><![CDATA[Toolkit]]></category>

		<guid isPermaLink="false">http://healthcare-economist.com/?p=6442</guid>
		<description><![CDATA[If you are evaluating the treatment effect of a policy or medical intervention, does it matter if some of your subjects leave the sample? In many cases, the answer is &#8216;yes&#8217;. The Problem As outlined in Grasdal (2001), the effect of the treatment is simply: Δ = E(Y&#124;X, T=1) − E(Y&#124;X, T=0) However, in some [...]]]></description>
			<content:encoded><![CDATA[<p>If you are evaluating the treatment effect of a policy or medical intervention, does it matter if some of your subjects leave the sample? In many cases, the answer is &#8216;yes&#8217;.</p>
<h3>The Problem</h3>
<p>As outlined in <a href="http://onlinelibrary.wiley.com/doi/10.1002/hec.628/abstract">Grasdal (2001)</a>, the effect of the treatment is simply:</p>
<ul>
<li>Δ = E(Y|X, T=1) − E(Y|X, T=0)</li>
</ul>
<p>However, in some cases we may not observe Y. For instance, if there is attrition in the study, we will not observe their outcomes. Thus, we can decompose the two components from the equation above as follows: The effect of treatment with attrition is:</p>
<ul>
<li>E(Y|X, T=1) = p<sub>T</sub>E(Y|X, T=1, A=0) + (1-p<sub>T</sub>)E(Y|X, T=1, A=1)</li>
<li>E(Y|X, T=0) = p<sub>C</sub>E(Y|X, T=0, A=0) + (1-p<sub>C</sub>)E(Y|X, T=0, A=1)</li>
</ul>
<p>where p<sub>T</sub> is the probability someone in the treatment group drops out of the sample (p<sub>T</sub>=p(A=0|X, T=1) and p<sub>C</sub> is the probability someone in the control group drops out of the sample (p<sub>C</sub>=p(A=0|X, T=0).</p>
<p>Rearranging terms we get:</p>
<ul>
<li>Δ = [E(Y|X, T=1, A=0)-E(Y|X, T=0, A=0)] + p<sub>T</sub>[E(Y|X, T=1, A=0)-E(Y|X, T=1, A=1)] + p<sub>C</sub>[E(Y|X, T=0, A=1)-E(Y|X, T=0, A=0)]</li>
</ul>
<p>The first term in brackets is what we observe. The second term in brackets is the difference between is the outcome in the treatment group for the attrition and non-attrition group; the third term in brackets gives the difference between is the outcome in the control group for the attrition and non-attrition group. With random attrition, the two expressions inside the square brackets will cancel out. If attrition is random, then estimating the treatment effect using the first equation will produce unbiased estimates.</p>
<h3>Potential Solutions</h3>
<p>If one knows the source of the attrition bias, one can explicitly model the source of the attrition. <a href="http://digitalcommons.unl.edu/cgi/viewcontent.cgi?article=1044&amp;context=famconfacpub">Explicit models</a> are typically sample selection model in which two simultaneous regression<br />
models are calculated. &#8220;The first model is a regression model that addresses the research question, with the hypotheses of the study being examined by the regression of the dependent variable on the key independent variables in the study. The second model includes the variables that are causing attrition, with the dependent variable being a dichotomous variable indicating either continued participation or nonparticipation in the study. The error terms of the substantive dependent variable in the first regression model and the participation dependent variable in the second regression model are correlated. A significant correlation between the two error terms indicates attrition bias.&#8221;</p>
<p>If the source of the bias is unknown, one can use the <a href="http://healthcare-economist.com/2008/05/28/sample-selection-vs-two-part-model/">Heckman selection model</a>. The first step of the Heckman selection model &#8220;&#8230;not only tests for attrition bias but also creates an outcome variable, which Heckman calls λ (lambda). Thus, a λ value is computed for all cases in the study, and it represents the proxy variable that explains the causation of attrition in the study&#8230;The second step of Heckman&#8217;s procedure is to merge the λ value of each participant into the larger data set and then include it&#8230;in the regression equation that is used to test the hypotheses in the study. Including λ in the equation solves the problem of specification error and leads to more accurate regression coefficients.&#8221;</p>
<h3>Empirical Investigation</h3>
<p>A <a href="http://onlinelibrary.wiley.com/doi/10.1002/hec.628/abstract">study by Grasdal</a> looks at attrition in a randomized field trial of a rehabilitation programme designed to bring long-term sick listed workers with musculoskeletal problems back to work in <a href="http://en.wikipedia.org/wiki/Bergen">Bergen, Norway</a>. In this case, they found that &#8220;Both the parametric and the semi-parametric sample estimators that were considered indicated that sample attrition biased outcome data regarding posttreatment earnings, while the data regarding sick leave status remained unbiased. The sample selection estimators of post-treatment earnings perform quite well in terms of correcting for attrition bias and estimating treatment effects not very different from the experimental benchmark.&#8221;</p>
<p>&#8230;The analysis also demonstrates an inherent paradox in the ‘common support’ approach, which prescribes exclusion from the analysis of observations outside of common support for the selection probability. The more important treatment status is as a determinant of attrition, the larger is the proportion of treated with support for the selection probability outside the range, for which comparison with untreated counterparts is possible.&#8221;</p>
<p>Source:</p>
<ul>
<li>Astrid Grasdal, &#8220;<a href="onlinelibrary.wiley.com/doi/10.1002/hec.628/abstract">The Performance of Sample Selection Estimators to Control for Attrition Bias</a>&#8221; Health Economics. Volume 10, Issue 5, Article first published online: 16 JUL 2001</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://healthcare-economist.com/2012/05/02/attrition-bias/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Halbert White, Jr., 1950-2012</title>
		<link>http://healthcare-economist.com/2012/04/02/a-sad-day/</link>
		<comments>http://healthcare-economist.com/2012/04/02/a-sad-day/#comments</comments>
		<pubDate>Mon, 02 Apr 2012 13:52:48 +0000</pubDate>
		<dc:creator>Jason Shafrin</dc:creator>
				<category><![CDATA[Econometrics]]></category>
		<category><![CDATA[Hal White]]></category>

		<guid isPermaLink="false">http://healthcare-economist.com/?p=6367</guid>
		<description><![CDATA[On Saturday, UCSD Economics Professor Dr. Hal White passed away after an extended struggle with cancer.  This is a sad day as Hal was one of my former professors.  Here is an except from obituary written by Dr. Jim Hamilton regarding Dr. White&#8217;s work. Hal was one of the world&#8217;s leading econometricians. One of his core beliefs [...]]]></description>
			<content:encoded><![CDATA[<p>On Saturday, UCSD Economics Professor Dr. Hal White passed away after an extended struggle with cancer.  This is a sad day as Hal was one of my former professors.  Here is an except from <a href="http://www.econbrowser.com/archives/2012/04/halbert_l_white.html">obituary written by Dr. Jim Hamilton</a> regarding Dr. White&#8217;s work.</p>
<p><em>Hal was one of the world&#8217;s leading econometricians. One of his core beliefs was that the models and assumptions that we bring to the data are inevitably flawed and misspecified in some way. It might seem that if you believe that, there&#8217;s no hope in trying to do econometrics. But some of Hal&#8217;s most remarkable discoveries concerned how to form valid inference even if part of what you assumed was fundamentally wrong.</em></p>
<p><em>An example arises in ordinary regression analysis, in which a common assumption is that the variance of the regression model&#8217;s error is the same for all observations. Suppose that assumption is wrong, and instead the variance depends in an unknown way on the various explanatory variables. Hal found that it is possible to <a href="http://dss.ucsd.edu/~hwhite/pub_files/hwcv-007.pdf">characterize how that dependence</a> would affect the reliability of the inference from the regression, and construct modified t-statistics or F-statistics that take this into account. This was such a useful contribution that it is now a standard option a user can easily select in any decent regression software package. Hal once lamented to me that this was an example of a contribution that became so successful and widespread that people forgot who came up with it in the first place. Hal&#8217;s proposed adjustments are often described as &#8220;robust standard errors&#8221; or &#8220;heteroskedasticity-consistent standard errors&#8221;, though I have always introduced them to my students as &#8220;White standard errors&#8221;.</em></p>
<p><em>Hal also showed that this idea generalizes much more broadly, as spelled out in his classic article, <a href="http://dss.ucsd.edu/~hwhite/pub_files/hwcv-010.pdf">Maximum Likelihood Estimation of Misspecified Models</a>. The maximum likelihood estimator (affectionately known as the &#8220;MLE&#8221;) refers to a particular estimate of parameters that is derived from the claim that the researcher knows the family from which the true probability distribution that generated the data comes. Hal&#8217;s remarkable contribution here was to examine the properties of that inference if you have assumed the wrong class of probability distributions. He referred to that procedure (using an MLE that is based on an incorrect assumption about the probability distribution) as &#8220;quasi maximum likelihood estimation.&#8221; Again establishing the properties of such inference seems like (and is!) an astounding result. But when you get into the math, you discover that it makes perfect sense. For example, one could assume (mistakenly, perhaps), that the error terms in the regression model came from a Normal distribution with mean zero and constant variance. If your assumptions were correct, then the MLE turns out to be the usual formula for regression estimation. However, even if your assumption about the probability distribution is wrong, one can show that what you were calling the MLE is usually still giving you a decent estimate of something, namely, an estimate of the best prediction of y if you want to base your prediction on a linear function of x. In fact, White&#8217;s robust standard errors for ordinary regression prove to be a special case of his general results for quasi maximum likelihood estimation.</em></p>
<p><em>Hal had a host of other very fundamental contributions, ranging from the recognition that neural networks are essentially a statistical inference problem, elegant contributions to asymptotic theory, any number of extremely useful specification tests, and his most recent interest in some very deep ideas about causality and inference. There are I suspect a great many papers by Hal and his co-authors that have not yet been published, but soon will be, as he remained astonishingly productive up to the end, writing papers faster than the journals could publish them</em>.</p>
]]></content:encoded>
			<wfw:commentRss>http://healthcare-economist.com/2012/04/02/a-sad-day/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Add to Your Skills Toolkit: Bootstrapping Confidence Intervals</title>
		<link>http://healthcare-economist.com/2012/03/06/add-to-your-skills-toolkit-bootstrapping-confidence-intervals/</link>
		<comments>http://healthcare-economist.com/2012/03/06/add-to-your-skills-toolkit-bootstrapping-confidence-intervals/#comments</comments>
		<pubDate>Tue, 06 Mar 2012 08:30:50 +0000</pubDate>
		<dc:creator>Jason Shafrin</dc:creator>
				<category><![CDATA[Econometrics]]></category>
		<category><![CDATA[Bootstrap]]></category>
		<category><![CDATA[Confidence Interval]]></category>
		<category><![CDATA[Statistics]]></category>

		<guid isPermaLink="false">http://healthcare-economist.com/?p=6207</guid>
		<description><![CDATA[In previous posts, I have explained how to create bootstrap estimates for a variety of statistics.  Doing so is fairly simple and involves a 3 step procedure: Step 1: Using the observe data, create m boostrap data sets by using  random resampling with replacement. Step 2: Calculate the statistic of interest for each bootstrap data [...]]]></description>
			<content:encoded><![CDATA[<p>In previous posts, I have explained how to create <a href="http://healthcare-economist.com/2007/05/24/bootstrapping/">bootstrap estimates for a variety of statistics</a>.  Doing so is fairly simple and involves a 3 step procedure:</p>
<ul>
<li><strong>Step 1</strong>: Using the observe data, create <em>m</em> boostrap data sets by using  random resampling <em>with replacement</em>.</li>
<li><strong>Step 2</strong>: Calculate the statistic of interest for each bootstrap data set.</li>
<li><strong>Step 3</strong>: The bootstrap estimate of the statistic of interest is the average value from Step 2 across all bootstrap samples.</li>
</ul>
<p>One question that has not yet been answered is how to calculate the confidence interval for the statistic of interest.  A paper by <a href="http://gcrc.labiomed.org/Biostat/Education/Case%20studies%202005/session2/Haukoos%20and%20Lewis%20Bootstrapping.pdf">Haukoos and Lewis</a> describes five methods for computing bootstrap confidence intervals: i) normal approximation, ii) percentile, iii) bias-corrected(BC), iv) bias-corrected and accelerated (BC<sub>a</sub>) and v) approximate bootstrap confidence (ABC) methods.</p>
<p>The normal approximation method is calculated as follows:</p>
<ul>
<li>original statistic +/- <em>Z</em>* (standard Error)</li>
</ul>
<p>For instance, for a 95% confidence interval, <em>Z</em>=1.96.  Another alternative is to use the percentile method.  To calculate, the percentile confidence intervals for a 95% CI, one simple takes calculates the 2.5 and 97.5 percentiles for the distribution of statistics calculated in Step 2 of the bootstrap procedures.</p>
<p>The other bootstrap CI methods are a bit more complex.</p>
<p>The BC<sub>a</sub> method adjusts for bias in the bootstrapped sampling distributions relative to the actual sampling distribution, and is thus considered a substantial improvement over the percentile method. The BC<sub>a</sub> confidence interval is an adjustment of the percentiles used in the percentile method based upon the calculation of two coefficients<br />
called ‘‘bias correction’’ and ‘‘acceleration.’’ The bias correction coefficient adjusts for the skewness in the bootstrap sampling distribution. If the bootstrap sampling distribution is perfectly symmetric, then the bias correction will be zero. The acceleration coefficient adjusts for nonconstant variances within the resampled data sets. The ABC method is an approximation of the BC<sub>a</sub> method that requires fewer resampled data sets than the BC<sub>a</sub> method.</p>
<p><span id="more-6207"></span></p>
<h3>Programming Bootstrap CI</h3>
<p>In Stata, programming the bootstrap CI using the base-corrected and accelerated method is straightforward.</p>
<blockquote><p><span style="font-family: courier;"> centile var1<br />
bs ‘‘centile var1’’ ‘‘r(c_1)’’,<br />
rep(2500)<br />
</span></p></blockquote>
<p>The &#8220;centile&#8221; command calculates the median value for the variable <em>var1</em>.  The &#8220;bs&#8221; command calculates the bootstrapped CI for the median value of var 1, where &#8220;r(c_1) refers to the reference statistics for which the 95% CI will be calculated.  The rep(2500) indicates that there will be 2500 re-sampled data sets.</p>
<p>Calculating the boostrapped correlation measure for two variables is just as easy.</p>
<blockquote><p><span style="font-family: courier;"> spearman var1 var2<br />
bs ‘‘spearman var1 var 2’’ ‘‘r(rho)’’,<br />
rep(2500)<br />
</span></p></blockquote>
<p>Programming the bootstrap in SAS relies on the <span style="font-family: courier;">%boot</span> macro. Additional information on bootstrap programming in SAS is available <a href="http://support.sas.com/kb/24/982.html">here</a>.</p>
<p>Source:</p>
<ul>
<li>Haukoos JS, Lewis RJ. <a href="http://gcrc.labiomed.org/Biostat/Education/Case%20studies%202005/session2/Haukoos%20and%20Lewis%20Bootstrapping.pdf">Advanced statistics: bootstrapping confidence intervals for statistics with &#8220;difficult&#8221; distributions</a>.  Acad Emerg Med. 2005 Apr;12(4):360-5.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://healthcare-economist.com/2012/03/06/add-to-your-skills-toolkit-bootstrapping-confidence-intervals/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Add to Your Skills Toolkit: The Oaxaca Decomposition</title>
		<link>http://healthcare-economist.com/2012/01/26/add-to-your-skills-toolkit-the-oaxaca-decomposition/</link>
		<comments>http://healthcare-economist.com/2012/01/26/add-to-your-skills-toolkit-the-oaxaca-decomposition/#comments</comments>
		<pubDate>Thu, 26 Jan 2012 16:15:47 +0000</pubDate>
		<dc:creator>Jason Shafrin</dc:creator>
				<category><![CDATA[Econometrics]]></category>
		<category><![CDATA[Economics - General]]></category>
		<category><![CDATA[Labor Economics]]></category>
		<category><![CDATA[Oaxaca Decomposition]]></category>
		<category><![CDATA[Regional Variation]]></category>

		<guid isPermaLink="false">http://healthcare-economist.com/?p=6150</guid>
		<description><![CDATA[Suppose you look at health care spending in two different regions and observe a significant difference.  You may want to know what the cause of this difference is.  Is it because one region has a mix of people who are sicker; or is because the reason treat patients with a given disease more intensively? One [...]]]></description>
			<content:encoded><![CDATA[<p>Suppose you look at health care spending in two different regions and observe a significant difference.  You may want to know what the cause of this difference is.  Is it because one region has a mix of people who are sicker; or is because the reason treat patients with a given disease more intensively?</p>
<p>One way to answer this question is to use the Oaxaca decomposition.  This approach was originally formulated by <a href="http://www-bcf.usc.edu/~ridder/Lnotes/Undeconometrics/Transparanten/Wagedecomp.pdf">Ronald Oaxaca</a>. <a href="http://siteresources.worldbank.org/INTPAH/Resources/Publications/459843-1195594469249/HealthEquityCh12.pdf">This document</a> provides a nice overview of how to use the Oaxaca Decomposition and I apply that framework to the health spending case.</p>
<h3>Differences in Health Spending</h3>
<p>Assume that there are two regions: Region A and Region B. The spending for the two regions can be modeled using a linear regression framework:</p>
<ul>
<li>Y<sup>A</sup> = β<sup>A</sup>X + ε<sup>A</sup></li>
<li>Y<sup>B</sup> = β<sup>B</sup>X + ε<sup>B</sup></li>
</ul>
<p>The <em>Y</em> term represents spending and the variable <em>X</em> represents the patient&#8217;s health status. Health status could be measured as a vector of factors or as a single indicator (e.g., healthy or sick). The term <em>β</em> describes much an area spending on medical resources to treat a patient with a health status of <em>X</em>. Thus, average difference in spending per person the two regions is:</p>
<ul>
<li>Y<sup>A</sup> &#8211; Y<sup>B</sup> = β<sup>A</sup>X<sup>A</sup> &#8211; β<sup>B</sup>X<sup>B</sup></li>
</ul>
<p>where <em>X<sup>A</sup></em> is the average case mix in the area.</p>
<h3>Determinants of Health Spending Differentials</h3>
<p>Now the question is whether case mix or spending practices conditional on case mix is the key driver of the differences in spending between regions A and B. One can differentiate these two components using the following Oaxaca Decomposition:</p>
<ul>
<li>Y<sup>A</sup> &#8211; Y<sup>B</sup> = ΔXβ<sup>B</sup> + ΔβX<sup>A</sup></li>
<li>Y<sup>A</sup> &#8211; Y<sup>B</sup> = ΔXβ<sup>A</sup> + ΔβX<sup>B</sup></li>
</ul>
<p>In the first equation, the differences in health status (<em>X</em>&#8216;s)are weighted by the coefficients for region B and the differences in the coefficients are weighted by the <em>X</em>’s from region A, whereas in the second, the differences in the <em>X</em>&#8216;s are weighted by the coefficients of from region A and the differences in the coefficients are weighted by the <em>X</em>&#8216;s of from region B.</p>
<p>There are basically three factors that effect health spending in the region: i) differences in health status across regions ii) differences in treatment patterns conditional on health status, and iii) the interaction of health status and conditional treatment effects. One can see this clearly below:</p>
<ul>
<li>Y<sup>A</sup> &#8211; Y<sup>B</sup> = ΔXβ<sup>B</sup> + ΔβX<sup>B</sup> + ΔXΔβ</li>
<li>Y<sup>A</sup> &#8211; Y<sup>B</sup> = H + T + HT</li>
</ul>
<p>The equations above show the health status effect (H), the treatment effect (T) and the interaction (HT).</p>
<p>The specification chosen for the Oaxaca decomposition determines whether the interaction effect is placed with the health status effect or the treatment effect.  More precisely:</p>
<ul>
<li>Y<sup>A</sup> &#8211; Y<sup>B</sup> = ΔXβ<sup>B</sup> + ΔβX<sup>A</sup> = H + (HT + T)</li>
<li>Y<sup>A</sup> &#8211; Y<sup>B</sup> = ΔXβ<sup>A</sup> + ΔβX<sup>B</sup> = (H+ HT) + T</li>
</ul>
<p>In effect, the first decomposition specification incorporates the interaction term with the treatment effect whereas the second specification places the interaction term together with the health status effect.</p>
<p>Sources:</p>
<ul>
<li><small>Oaxaca, R. 1973. “<a href="http://www-bcf.usc.edu/~ridder/Lnotes/Undeconometrics/Transparanten/Wagedecomp.pdf">Male-Female Wage Differentials in Urban Labor Markets</a>.” <em>International Economic Review</em> 14: 693–709.</small></li>
<li><small>Owen O’Donnell, Eddy van Doorslaer, Adam Wagstaff, Magnus Lindelow &#8220;<a href="http://siteresources.worldbank.org/INTPAH/Resources/Publications/459843-1195594469249/HealthEquityCh12.pdf">Explaining Differences between Groups: Oaxaca Decomposition</a>&#8221; Chapter 12, <a href="http://siteresources.worldbank.org/INTPAH/Resources/Publications/459843-1195594469249/HealthEquityFINAL.pdf">Analyzing Health Equity Using Household Survey Data</a>, World Bank, WBI Learning Resources Series, 2008</small>.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://healthcare-economist.com/2012/01/26/add-to-your-skills-toolkit-the-oaxaca-decomposition/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>MEPS vs. NHEA</title>
		<link>http://healthcare-economist.com/2011/12/20/meps-vs-nhea/</link>
		<comments>http://healthcare-economist.com/2011/12/20/meps-vs-nhea/#comments</comments>
		<pubDate>Tue, 20 Dec 2011 08:24:04 +0000</pubDate>
		<dc:creator>Jason Shafrin</dc:creator>
				<category><![CDATA[CHIP]]></category>
		<category><![CDATA[Data]]></category>
		<category><![CDATA[Econometrics]]></category>
		<category><![CDATA[Medicaid]]></category>
		<category><![CDATA[MEPS]]></category>
		<category><![CDATA[National Health Expenditures]]></category>
		<category><![CDATA[NHEA]]></category>
		<category><![CDATA[PHC]]></category>

		<guid isPermaLink="false">http://healthcare-economist.com/?p=5873</guid>
		<description><![CDATA[Many researchers use household data sources to examine a variety of hypothesis.  The use of household data has many benefits including allowing for more detailed socioeconomic information (e.g., education, income) beyond what is contained in administrative claims files.  One drawback of household data is that extrapolations made from household survey data may not match national [...]]]></description>
			<content:encoded><![CDATA[<p>Many researchers use household data sources to examine a variety of hypothesis.  The use of household data has many benefits including allowing for more detailed socioeconomic information (e.g., education, income) beyond what is contained in administrative claims files.  One drawback of household data is that extrapolations made from household survey data may not match national estimates.</p>
<p>For instance, <a href="http://www.meps.ahrq.gov/mepsweb/data_files/publications/workingpapers/wp_08006.pdf">this article</a> examines how to align the Medical Expenditure Panel Survey (MEPS) to aggregate U.S. benchmarks provided in the National Health Expenditure Accounts (<a href="https://www.cms.gov/nationalhealthexpenddata/">NHEA</a>).  Today, I review some of these adjustments.</p>
<p><span id="more-5873"></span><br />
The article by <a href="http://www.meps.ahrq.gov/mepsweb/data_files/publications/workingpapers/wp_08006.pdf">Selden and Sing (2008)</a> compares NHEA Personal Health Care (PHC) expenditures against MEPS expenditures. Whereas total NHEA in the U.S. was $1.603 trillion in 2003, PHC was only $1.341 trillion since it excludes administrative costs, public health, research, and construction, none of which are captured by MEPS. Using the PHC as the starting point, here are some other key differences between the MEPS and NHEA PHC measure.</p>
<h3>Matching MEPS to NHEA PHC using MEPS definition of medical expenditures</h3>
<ul>
<li><strong>Institutionalized population and active military</strong>.  PHC includes these individuals, but MEPS excludes these individuals.</li>
<li><strong>Medicaid capitated payments</strong>.  The authors &#8220;&#8230;modified the NHEA allocation of capitated Medicaid payments across types of service using the MEPS expenditure distribution, rather than the fee-for-service Medicaid distribution used in the construction of NHEA. This adjustment shifted expenditures from Medicaid Hospital to Medicaid Physician.&#8221;</li>
<li><strong>Drug Rebates</strong>. NHEA includes rebates public insurance entities receive from pharmaceutical companies.  These rebates are not captured in MEPS.  The authors remove drug rebates from the NHEA PHC estimates.</li>
<li><strong>Medicaid/CHIP underreporting</strong>.  Like most household surveys, the MEPS data contains fewer observations of individuals stating that they have Medicaid or Children’s Health Insurance Program (CHIP) coverage than is the case when these figures are reported in administrative data.  To correct for this underreporting, the authors used &#8220;a 10 percent upweighting of Medicaid and SCHIP recipients, using a raking post-stratification to preserve the MEPS distribution of poverty level, age, sex, Medicare enrollment and uninsurance.&#8221;</li>
<li><strong>Underreporting of high cost cases</strong>.  The authors cite other research (Zuvekas, Cohen and Pylypchuk, 2005; <a href="http://www.inquiryjournalonline.org/doi/abs/10.5034/inquiryjrnl_46.01.92">Zuvekas, Olin, 2009</a>) which claim that MEPS underreports high cost cases.  This could be due in part because severely ill individuals may be have higher attrition rates.  Thus, the authors modify the MEPS sampling weights to increase the prevalence of high-cost cases.  They do this using a partial non-response model. &#8220;Our upweighting strategy targets the top three percent of the expenditure distribution in each of four (hierarchically defined) coverage groups: ever on Medicare, ever on non-Medicare Medicaid and SCHIP, ever on Private, and full-year uninsured. A raking post-stratification was implemented to preserve MEPS distributions by age, sex, race/ethnicity, and poverty level (along with coverage). The average increase in weight was 18.1 percent&#8230;&#8221;</li>
<li><strong>Laboratory Tests</strong>. &#8220;One area in which MEPS is particularly low is separately-billed laboratory tests, the number and financing of which are difficult to ascertain either from household respondents or from follow-back visits to providers ordering the tests.&#8221;  The authors allocated that additional lab spending test to individuals in a proportional fashion based on each individual&#8217;s use of physician services.</li>
<li><strong>Other adjustments</strong>. For the remaining adjustments, the authors simply scaled the MEPS amounts to match the NHEA figures.</li>
</ul>
<h3>Adjusting MEPS to match NHEA definition of medical expenditures</h3>
<p>The list above describes how to calibrate MEPS to match NHEA figures for a MEPS-based definition of medical expenditures. However, the NHEA PHC figures include some expenditures which are not included in MEPS. These costs include:</p>
<ul>
<li><strong>Non-medical assistance with activities of daily living</strong>. These costs are born mostly by Medicaid. The authors allocated these costs in proportion to home health care by source of payment.</li>
<li><strong>Hospital Subsidies</strong>.  These include Medicare and Medicaid disproportionate share (DSH) payments, Medicare graduate medical education subsidies, State and local subsidies to public hospitals, Medicare retrospective adjustments and capital pass-throughs.</li>
<li><strong>Public health, public research, public investment in structures and equipment</strong>.</li>
<li><strong>Administrative Costs</strong>.  For public programs, the authors allocate administrative costs in proportion to spending on care.  For private insurance, the authors use a regression-based model which compares health expenditures against insurance premiums paid by households and employers.</li>
<li><strong>Tax Expenditures</strong>.  Tax expenditures include: i) the exemption of employer health insurance contributions to from employee income and payroll taxes and ii) the exemption of medical care from state and local sales taxes</li>
</ul>
<h3>Comparing MEPS to Medicare claims data</h3>
<p>Not only does MEPS underreport utilization compared the NHEA, similar differences are found when comparing to Medicare claims data. Zuvekas and Olin (2009) find a 19 percent gap between MEPS and Medicare claims data. The The key factors driving these differentials are underreporting of ehalth care utilization by respondents and underrepresentation of high expenditure cases in MEPS.</p>
<p>Specifically, quantities in MEPS come from household responses. Generally, household accurately report inpatient stays, but underreport emergency department and office visits. Further, household generally estimate their out of pocket cost accurately, but most &#8220;&#8230;may not know third-party payments at all or report them inaccurately because of confusion about discounts, adjustments, and contractual arrangements.&#8221;</p>
<p>Underrepresentation of high cost cases about $25,000 is also a problem. Whereas the top decile spending levels in the claims data spend $23,900, in MEPS the figure is $26,700. For the top five percentiles, Medicare claims has average spending of $38,500, compared to $34,600 in MEPS.</p>
<p><!--more--></p>
<p><em>Sources</em>:</p>
<ul>
<li>Thomas Selden, Merrile Sing, &#8220;<a href="http://www.meps.ahrq.gov/mepsweb/data_files/publications/workingpapers/wp_08006.pdf">Aligning the Medical Expenditure Panel Survey to Aggregate U.S. Benchmarks</a>,&#8221; AHRQ Working Paper No. 08006, July 2008.</li>
<li>Samuel H. Zuvekas, Gary L. Olin, &#8220;<a href="http://www.hsr.org/hsr/abstract.jsp?aid=44912215406">Validating Household Reports of Health Care Use in the Medical Expenditure Panel Survey</a>&#8221; Health Services Research, VOLUME 44 | NUMBER 5 | OCTOBER 2009.</li>
<li>Samuel H. Zuvekas, Gary L. Olin, &#8221;<a href="http://www.ncbi.nlm.nih.gov/pubmed/19489486">Accuracy of Medicare Expenditures in the Medical Expenditure Panel Survey</a>&#8221; Inquiry, 209 Spring; 46(1):92-108.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://healthcare-economist.com/2011/12/20/meps-vs-nhea/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Kappa Statistic</title>
		<link>http://healthcare-economist.com/2011/11/02/kappa-statistic/</link>
		<comments>http://healthcare-economist.com/2011/11/02/kappa-statistic/#comments</comments>
		<pubDate>Wed, 02 Nov 2011 11:40:21 +0000</pubDate>
		<dc:creator>Jason Shafrin</dc:creator>
				<category><![CDATA[Econometrics]]></category>

		<guid isPermaLink="false">http://healthcare-economist.com/?p=5894</guid>
		<description><![CDATA[Many research studies aim to figure out if a physicians did a good job.  Many studies use administrative claims data to evaluate performance.  Other times, researchers use medical record review. One problem with medical record review is that oftentimes experts will come up with differing opinions from reviewing the same medical record.  Thus, researchers often [...]]]></description>
			<content:encoded><![CDATA[<p>Many research studies aim to figure out if a physicians did a good job.  Many studies use administrative claims data to evaluate performance.  Other times, researchers use medical record review.</p>
<p>One problem with medical record review is that oftentimes experts will come up with differing opinions from reviewing the same medical record.  Thus, researchers often have at least two individuals review the medical record so that the results are not biased by a single person&#8217;t opinion.</p>
<p>A question of interest is how reliable are different evaluators of medical record.  Cohen&#8217;s kappa can provide a quantitative estimate of inter-rater reliability.  The formula is the following:</p>
<ul>
<li>[P(a)-P(e)]/[1-P(e)]</li>
</ul>
<div><span class="Apple-style-span" style="line-height: 17px;">Where <em>P(a)</em> is the observed level of agreement and <em>P(e)</em> is the expected level of agreement from pure chance.  </span><span class="Apple-style-span" style="line-height: 17px;">In essence, the kappa measurement compares the observed level of inter-rater agreement against the level of agreement that would be expected by pure chance.  </span></div>
<p>To give an example, consider the situation where two raters rate 10 blogs and can give them a rating of an A, B, or C. These data are available <a href="https://docs.google.com/spreadsheet/ccc?key=0AqBLM3x5sYdBdG9UOWt2UGltNG94aGlTSFYtVUR5MlE">here</a>.  You can see that Tester 1 is more likely to give positive ratings and Tester 2 is more likely to give negative ratings.  In this example, the value of Kappa is 0.44.</p>
<p>A general rule of thumb to follow is values &lt; 0 as indicating no agreement, 0–.20 as slight, .21–.40 as fair, .41–.60 as moderate, .61–.80 as substantial, and .81–1 as almost perfect agreement.</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://healthcare-economist.com/2011/11/02/kappa-statistic/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Biases</title>
		<link>http://healthcare-economist.com/2011/06/28/biases/</link>
		<comments>http://healthcare-economist.com/2011/06/28/biases/#comments</comments>
		<pubDate>Tue, 28 Jun 2011 14:05:41 +0000</pubDate>
		<dc:creator>Jason Shafrin</dc:creator>
				<category><![CDATA[Econometrics]]></category>
		<category><![CDATA[Bias]]></category>
		<category><![CDATA[Statistics]]></category>

		<guid isPermaLink="false">http://healthcare-economist.com/?p=5483</guid>
		<description><![CDATA[All economists are familiar with the problem of selection bias.  In non-randomized samples, patients may choose to be in either the treatment or control group based on factors which are also related to the outcome of interest.  Even if researchers can design a study that fully controls for selection bias, robust studies must also account [...]]]></description>
			<content:encoded><![CDATA[<p>All economists are familiar with the problem of selection bias.  In non-randomized samples, patients may choose to be in either the treatment or control group based on factors which are also related to the outcome of interest.  Even if researchers can design a study that fully controls for selection bias, robust studies must also account for other biases.  These include:</p>
<ul>
<li><strong>Recall bias</strong>: Patients in one group have better or worse memory of a given event.  If one wishes to compare changes in income for individual who received certain workforce training, individuals who participated in the program may be more or less likely to inflate their income levels over time.</li>
<li><strong>Interviewer bias</strong>: If new data is being collected and researchers use separate interviewers for the treatment and control groups, if one interviewer systematically over/understates the interviewee responses, the study results will be biased.</li>
<li><strong>Observation bias</strong>: This problem is particularly problematic for medical studies.  Observation bias occurs when physicians (or patients) are more likely to detect a disease.  Thus, a study identifying how pollution affected disease rates may underestimate the impact of the pollution if those affected are less likely to detect any disease than those who are not.  For instance, if poor individuals are more likely to drink polluted water than rich individuals, but also less likely to go to the doctor, the disease incidence from polluted water would be underreported and the causal impact of water pollution would be underestimated.</li>
</ul>
<p>Outside of purely statistical biases, the research community at large may suffer from other biases as well.  These include:</p>
<ul>
<li><strong>Funding bias</strong>: Researcher bias towards interpreting quantitative results in favor of the entity which funded their study.</li>
<li><strong>Status quo bias</strong>: Survey respondents may base their opinions closer to the status quo or researchers can interpret their results in a fashion more likely to coincide with the existing academic literature.</li>
<li><strong>Publication Bias</strong>: tendency of researchers, editors, and pharmaceutical companies to handle the reporting of experimental results that are positive (i.e. showing a significant finding) differently from results that are negative (i.e. supporting the null hypothesis) or inconclusive, leading to bias in the overall published literature.</li>
<li><strong>Hindsight bias</strong>: is the inclination to see events that have already occurred as being more predictable than they were before they took place</li>
</ul>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://healthcare-economist.com/2011/06/28/biases/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Quantile Regressions</title>
		<link>http://healthcare-economist.com/2011/05/10/quantile-regressions/</link>
		<comments>http://healthcare-economist.com/2011/05/10/quantile-regressions/#comments</comments>
		<pubDate>Tue, 10 May 2011 08:01:19 +0000</pubDate>
		<dc:creator>Jason Shafrin</dc:creator>
				<category><![CDATA[Econometrics]]></category>
		<category><![CDATA[Percentile]]></category>
		<category><![CDATA[Quantile Regression]]></category>
		<category><![CDATA[Regression]]></category>

		<guid isPermaLink="false">http://healthcare-economist.com/?p=5204</guid>
		<description><![CDATA[Understanding quantiles is fairly intuitive. A physician would rank in the τth quantile of in terms of quality of care if he performs better than the proportion τ of the reference group of physicians and worse than the proportion (1–τ). For physicians at the median, half of physicians will perform worse than this doctor and [...]]]></description>
			<content:encoded><![CDATA[<p>Understanding quantiles is fairly intuitive.  A physician would rank in the τth quantile of in terms of quality of care if he performs better than the proportion τ of the reference group of physicians and worse than the proportion (1–τ). For physicians at the  median, half of physicians will perform worse than this doctor and half will perform better.</p>
<p>Quantile regressions, however, offer the power to evaluate whether the predicted effect of selected explanatory variables on the outcome of interest differs depending on the where in the distribution the individual is located.  Koenker and Bassett (1978) created these regression models and based them on the same intuition used to calculate the median. Today I review contrasts how quantile regressions work compared to ordinary least squares (OLS).</p>
<h3>Mean vs. Quantile</h3>
<p>The simplest way to compare OLS against quantile regression is to compare optimization methods for the mean and quantiles (e.g., median).  Most people know the mean and median formulas, but the following specifications detail how to calculate these values for any sample using optimization techniques.</p>
<ul>
<li>Mean: min <sub>μ∈ℜ </sub>Σ (y<sub>i</sub> &#8211; μ)<sup>2</sup></li>
<li>Quantile: min <sub>ξ∈ℜ </sub>Σ ρ<sub>τ</sub>(y<sub>i</sub> &#8211; ξ)</li>
</ul>
<p>where the function ρ<sub>τ</sub>(x) = x(τ &#8211; I(x&lt;0)).  In essence, the function ρ<sub>τ</sub> tilts the absolute value function towards the quantile under investigation.  For the mean, the goal is to pick the a parameter (the mean) which will minimize the sum of squared deviations.  For the quantile, the goal is to pick a parameter which will minimize the sum of absolute deviations.  For the median, the absolute deviations are weighted equally whereas for other quantiles deviations closer the quantile of interest receive more weight than those further away.</p>
<p>I have created <a href="https://spreadsheets.google.com/pub?hl=en&amp;hl=en&amp;key=0AqBLM3x5sYdBdDBtYUdoR25LQmx6NXJVSmVYZzNENEE&amp;output=html">this spreadsheet</a> to more clearly demonstrate how calculating quantiles can be done in practice.  Wikipedia also has a <a href="http://en.wikipedia.org/wiki/Quantile_regression">nice example</a>.</p>
<h3>OLS vs. Quantile Regression</h3>
<p>Again, compare the mechanisms by which OLS and quantile regressions choose the coefficients (i.e., β) to optimize the equations below.</p>
<ul>
<li>OLS: min <sub>β∈ℜ </sub>Σ (y<sub>i</sub> &#8211; Xβ)<sup>2</sup></li>
<li>Quantile Regression: min <sub>β<sub>τ</sub>∈ℜ </sub>Σ ρ<sub>τ</sub>(y<sub>i</sub> &#8211; Xβ<sub>τ</sub>)</li>
</ul>
<p>When you calculate the sample mean, you are calculating the unconditional population mean [i.e., E(y)]. When you conduct the OLS regression, one calculates the <em>conditional </em>expectation function E(y|X)].  Similarly, the quantile regression is used to estimate the <em>conditional</em> quantile of the dependent variable.</p>
<p>To conduct the quantile regression in SAS, on can perform the <a href="http://www2.sas.com/proceedings/sugi30/213-30.pdf">QUANTREG</a> function.  In Stata one can use the <a href="http://www.stata.com/capabilities/qreg.html">qreg</a> function.</p>
<h3>Quantile Regression in Practice</h3>
<p>An example of a paper using Quantile Regression includes the following: Johar, M. and Katayama, H. (2011), <a href="http://onlinelibrary.wiley.com/doi/10.1002/hec.1736/abstract">Quantile regression analysis of body mass and wages</a>. Health Economics, 20: n/a. doi: 10.1002/hec.1736.  This paper uses the National Longitudinal Survey of Youth 1979, to explore the relationship between body mass and wages. The researchers use quantile regression to provide a broad description of the relationship across the wage distribution.  &#8220;Our results find that for female workers body mass and wages are negatively correlated at all points in their wage distribution. The strength of the relationship is larger at higher-wage levels. For male workers, the relationship is relatively constant across wage distribution but heterogeneous across ethnic groups.&#8221;</p>
<p>Sources:</p>
<ul>
<li><small>Koenker, Roger and Gilbert Bassett. 1978. “Regression Quantiles.” Econometrica. January, 46:1, pp. 33–50.</small></li>
<p><small></small></p>
<p><small></small></p>
<p><small></p>
<li>Koenker, Roger and Kevin F. Hallock 2001. “<a href="http://www.econ.uiuc.edu/~roger/research/rq/QRJEP.pdf">Quantile Regression</a>.” Journal of Economic Perspectives. 15:4, pp. 143–156.</li>
<li>Colin (Lin) Chen. &#8220;<a href="http://www2.sas.com/proceedings/sugi30/213-30.pdf">An Introduction to Quantile Regression and the QUANTREG Procedure</a>&#8221; Paper 213-30, SAS Institute, Inc.</li>
<p></small><small></small><small></small><small></small><small></small><small></small></p>
<li><small>Brian S. Cade, Barry R. Noon, (2003) &#8220;<a href="http://www.econ.uiuc.edu/~roger/research/rq/QReco.pdf">A gentle introduction to quantile regression for ecologists</a>&#8220;, Frontiers in Ecology and the Environment, 1 (8), 412–420</small>.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://healthcare-economist.com/2011/05/10/quantile-regressions/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Testing precision and accuracy</title>
		<link>http://healthcare-economist.com/2011/05/03/testing-precision-and-accuracy/</link>
		<comments>http://healthcare-economist.com/2011/05/03/testing-precision-and-accuracy/#comments</comments>
		<pubDate>Tue, 03 May 2011 07:04:57 +0000</pubDate>
		<dc:creator>Jason Shafrin</dc:creator>
				<category><![CDATA[Econometrics]]></category>
		<category><![CDATA[Predictive Value]]></category>
		<category><![CDATA[Testing]]></category>

		<guid isPermaLink="false">http://healthcare-economist.com/?p=5121</guid>
		<description><![CDATA[How does one determine if a test is accuracy?  What does accuracy mean? One measure of test precision it is the positive predictive value, or the share of positive test results which are actually positive.  Alternatively, the negative predictive value determines the share of negative test results which are true (rather than false) negatives.  Better positive and [...]]]></description>
			<content:encoded><![CDATA[<p>How does one determine if a test is accuracy?  What does accuracy mean? One measure of test precision it is the positive predictive value, or the share of positive test results which are actually positive.  Alternatively, the negative predictive value determines the share of negative test results which are true (rather than false) negatives.  Better positive and negative predictive value indicates a better test.</p>
<p>In addition, sensitivity and specificity uses the gold standard (i.e., &#8220;true&#8221;) results as the denominator.  Sensitivity indicates the share of true positives as a fraction of total people who actually have the condition. Similarly, specificity gives the number of true negatives as a share of the number of test subjects who actually had the disease.</p>
<p>The formulas for these four metrics  describing the accuracy of various diagnostic testing procedures is shown below:</p>
<ul>
<li>
<pre><strong>Positive Predictive Value</strong>:  TP/(TP+FP)</pre>
</li>
<li>
<pre><strong>Negative Predictive Value</strong>:  TN/(TN+FN)</pre>
</li>
<li>
<pre><strong>Sensitivity</strong>:                TP/(TP+FN)</pre>
</li>
<li>
<pre><strong>Specificity</strong>:                TN/(FP+TN)</pre>
</li>
</ul>
<p><a href="https://spreadsheets.google.com/pub?hl=en&amp;hl=en&amp;key=0AqBLM3x5sYdBdHFSMjZUMzNBbTRFU0RGVndTVHNnLWc&amp;output=html">This example</a> below from <a href="http://en.wikipedia.org/wiki/Positive_predictive_value">Wikipedia</a> provides a simple example.</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://healthcare-economist.com/2011/05/03/testing-precision-and-accuracy/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

