Week 8: Shocks #
Monday, October 28, 2024
7 Days until Presidential Election
Can you believe there is just a week left until election day? Thank you for following along throughout this entire process! This will be my second-to-last post, and next week I will post my final prediction. This week we’ve been learning about October surprises, which refer to unanticipated events that happen in the month before election day that are thought to impact the final results of the race. This week, the Trump campaign held an event at Madison Square Garden where the comedian Tony Hinchcliffe made problematic jokes about Puerto Rico and Latinos, possibly alienating a demographic that has Trump has made headway with this cycle. President Biden made a clumsily-worded response to the event with people construing his words as referring to Trump’s supporters as “ garbage”. I also dealt with my own “October surprise” this week after realizing that my absentee ballot that I applied for two weeks ago likely got lost in the mail and that it was too late to request another to my dorm in Cambridge, MA. After finding a flight back home to Atlanta for $75, I decided it was worth it to cast my ballot in person on election day. Hopefully, everything goes smoothly, and I can participate in the electoral process. Because the impact of shocks are difficult to measure on vote behavior or outcomes, we will use our analysis this week just to update my model and evaluate its results.
Updating Model Predictions #
This week to update model predictions, I rely on the same inputs as the last few weeks: Democrat 2 Party Vote Shares in the past two elections, latest poll averages for Democrats, mean poll averages for Democrats, Consumer Price Index, quarterly GDP growth, and campaign donations. Realistically, the only thing that has changed between past weeks’ models and this week’s is polling data. The most recent polling data I include has results from up to 8 days before the election day on November 5th.
Model Predictions without Regularization #
state | mean_dem | sd_dem | lower_dem | upper_dem | mean_rep | sd_rep | lower_rep | upper_rep |
---|---|---|---|---|---|---|---|---|
Arizona | 52.95022 | 0 | 52.95022 | 52.95022 | 47.04978 | 0 | 47.04978 | 47.04978 |
Georgia | 53.15389 | 0 | 53.15389 | 53.15389 | 46.84611 | 0 | 46.84611 | 46.84611 |
Michigan | 53.80728 | 0 | 53.80728 | 53.80728 | 46.19272 | 0 | 46.19272 | 46.19272 |
Nevada | 54.02149 | 0 | 54.02149 | 54.02149 | 45.97851 | 0 | 45.97851 | 45.97851 |
North Carolina | 53.05170 | 0 | 53.05170 | 53.05170 | 46.94830 | 0 | 46.94830 | 46.94830 |
Pennsylvania | 53.31148 | 0 | 53.31148 | 53.31148 | 46.68852 | 0 | 46.68852 | 46.68852 |
Wisconsin | 53.98853 | 0 | 53.98853 | 53.98853 | 46.01147 | 0 | 46.01147 | 46.01147 |
Using these inputs, I created a simple prediction of Democratic vote share through a linear regression model; I calculated Republican vote share by subtracting Democratic 2-Party vote share from 100%. This resulted in some pretty obviously problematic findings: that Harris will take all battleground states with a clear lead and miniscule standard deviation that doesn’t place the results within the margin of error. This deviates from all credible models and is not something I would trust. So, I will not use these findings for my final forecast.
LASSO Regression Model #
Instead of a simple linear regression, I find it more useful to employ a LASSO Regression model, which will use only those variables with large enough coefficients (or effects on Democrat 2-Party Vote Share) and nullify other small variables. I choose this model because I am incredibly cautious of constructing an overfitted and overly complex model to forecast the election results. Realistically, I am focused on three things: relevant economic indicators, campaign donations, and Democratic performance in polls and past elections.
state | electors | Democrat | Republican | winner |
---|---|---|---|---|
Arizona | 11 | 49.29145 | 50.70855 | Republican |
Georgia | 16 | 49.48063 | 50.51937 | Republican |
Michigan | 15 | 50.18718 | 49.81282 | Democrat |
Nevada | 6 | 50.02578 | 49.97422 | Democrat |
North Carolina | 16 | 49.33849 | 50.66151 | Republican |
Pennsylvania | 19 | 49.86543 | 50.13457 | Republican |
Wisconsin | 10 | 50.34243 | 49.65757 | Democrat |
The results of using LASSO regression are a bit more believable, showing some battleground states going to Harris and most to Trump. The split of battleground states also comports well with what other models are predicting and my general intuition. I would be apprehensive about the exact point estimates of Democrat and Republican vote shares, but I think generally the winner for each state makes sense. Now, let’s take these results, fit them within the context of the whole nation, and see who is predicted to win the electoral college next week.
State | Predicted Electoral Votes | Winner |
---|---|---|
Alabama | 9 | Republican |
Alaska | 3 | Republican |
Arizona | 11 | Republican |
Arkansas | 6 | Republican |
California | 54 | Democrat |
Colorado | 10 | Democrat |
Connecticut | 7 | Democrat |
Delaware | 3 | Democrat |
District Of Columbia | 3 | Democrat |
Florida | 30 | Republican |
Georgia | 16 | Republican |
Hawaii | 4 | Democrat |
Idaho | 4 | Republican |
Illinois | 19 | Democrat |
Indiana | 11 | Republican |
Iowa | 6 | Republican |
Kansas | 6 | Republican |
Kentucky | 8 | Republican |
Louisiana | 8 | Republican |
Maine | 4 | Democrat |
Maryland | 10 | Democrat |
Massachusetts | 11 | Democrat |
Michigan | 15 | Democrat |
Minnesota | 10 | Democrat |
Mississippi | 6 | Republican |
Missouri | 10 | Republican |
Montana | 4 | Republican |
Nebraska | 5 | Republican |
Nevada | 6 | Democrat |
New Hampshire | 4 | Democrat |
New Jersey | 14 | Democrat |
New Mexico | 5 | Democrat |
New York | 28 | Democrat |
North Carolina | 16 | Republican |
North Dakota | 3 | Republican |
Ohio | 17 | Republican |
Oklahoma | 7 | Republican |
Oregon | 8 | Democrat |
Pennsylvania | 19 | Republican |
Rhode Island | 4 | Democrat |
South Carolina | 9 | Republican |
South Dakota | 3 | Republican |
Tennessee | 11 | Republican |
Texas | 40 | Republican |
Utah | 6 | Republican |
Vermont | 3 | Democrat |
Virginia | 13 | Democrat |
Washington | 12 | Democrat |
West Virginia | 4 | Republican |
Wisconsin | 10 | Democrat |
Wyoming | 3 | Republican |
Winner | Electoral Votes |
---|---|
Democrat | 257 |
Republican | 281 |
In previous weeks, I took certain states as “red” states and others as “blue” states based on my intuition and without much justification. The purpose of this was to predict only for battleground states because all other states’ electoral votes were assumed to go to Harris or Trump. I should have defended this assumption, so this week, I am updating my model to use the party winner of the state in 2020 for all states except for the battleground states (instead of just casting certain states as Republican or Democratic givens).
After doing this, I find my model predicts that the Republicans will take the White House in November, having won all the non-battleground states they did in 2020 and taking Arizona, Georgia, North Carolina, and Pennsylvania. This means that, between 2020 and 2024, the Democrats would have lost Arizona, Georgia, and Pennsylvania.
Looking at the map, we can see that the South once again is predicted to vote as a Republican bloc, the Northeast and West Coast remain Democratic strongholds, and the Rust Belt states split down the middle.
Conclusion #
According to this week’s model, Trump will win the 2024 Presidential Election, taking 281 electoral votes.
In comparison to last week’s model, this week presents a more tempered victory for Trump. I made an effort to regularize my model this week through LASSO this week because I want to be as conservative as possible about introducing new variables into my model, and I think that is mainly why this week’s model presents a much much tighter margin. Still, this favors Trump compared to the last time I used LASSO regression because of most recent polling data. In my final prediction, I will continue to rely on this structure and regularize my model with LASSO. It is important to note that the margin of error straddles the win threshold, so these predictions can really go either way to Trump or Harris. Realistically, no one should be surprised by the winner of the election because this race is so close.
Sources #
“‘Supporters” or “supporter’s’? Biden comments about Trump ‘garbage’ rally anger the GOP” CBS News, 2024. https://www.cbsnews.com/news/bidens-response-to-garbage-joke-about-puerto-rico/.
Polling Data Provided by GOV 1347: Election Analytics teaching staff (which drew from the FiveThirtyEight GitHub)
Economic Data Provided by GOV 1347: Election Analytics teaching staff (which drew from the Bureau of Economic Analysis and Federal Reserve Economic Data)