My completely uninformed guide to March Madness and some thoughts on my kaggle entry

I submitted my March Madness Machine Learning Mania today. My two entries consist of picks made using the actual spread for first round games and then a simple Bradley-Terry model for the games past that. In my second bracket, which I called the “aggressive” one, I picks UConn in the men’s bracket and South Carolina in the women’s bracket to win each round (and thus the tournament) with probability 1. So is UConn and South Carolina win, maybe I have a shot at winning. I also manually adjusted two teams on the women’s side (South Carolina and USC). The problem with South Carolina is that they haven’t lost any games and Bradley-Terry basically can’t handle that. I adjusted their regression coefficients to match the market price that they win the tournament. I also adjusted USC on the women’s side because they were way off their market price too. I didn’t make any adjustments to the men’s side because the futures prices were generally in the ball park with what I was estimating (i.e. there are three truly top teams in the men’s bracket (Purdue, UConn, Houston), then a sizable gap down to the next team (which I think is Iowa State. North Carolina got a gift of a 1 seed.)

I’m really excited about the new scoring system for Kaggle this year. And I think they got the scoring system right. A few weeks ago I believe I read that the scoring system was going to be average bracket score with traditional bracket scoring (1-2-4-8-16-32). My first thought when I saw this was that the best strategy is to just enter one bracket and hope. I think other people figured this out and they change it to a Brier score metric. But what they really got right this year is that that don’t take the average over all the GAMES, they take the average over the 6 ROUNDS. This weights the game in the finals much more heavily than a game in the first round, much closer to traditional bracket games.

Anyway, here are some probabilities below are based on my 10000 brackets that I submitted:

To win the championship:

UConn – 29.3%

Houston – 20.9%

Purdue – 19.4%

Iowa St – 5.19%

North Carolina – 4.96%

Tennessee – 3.39%

Marquette – 2.49%

Auburn – 1.94%

Illinois – 1.51%

Baylor – 1.43%

Everyone else < 1%

To make the finals:

UConn – 45.5%

Houston – 34.8%

Purdue – 32.4%

North Carolina – 12.6%

Iowa St – 11.4%

Tennessee – 7..92%

Marquette – 6.51%

Auburn – 5.8%

Illinois – 4.54%

Baylor – 4.34%

Arizona – 3.16%

Alabama – 2.68%

South Carolina – 2.46%

Kansas – 2.31%

Kentucky – 2.13%

Creighton 2.03%

Everyone else < 2%

And finally, here are my pre-tournament rankings 1 through UMass:

Rank
TeamName
1Connecticut
2Houston
3Purdue
4Iowa St
5North Carolina
6Tennessee
7Auburn
8Marquette
9Illinois
10South Carolina
11Baylor
12Kansas
13Utah St
14Creighton
15Arizona
16Duke
17Nevada
18Kentucky
19Alabama
20San Diego St
21BYU
22Florida
23Texas Tech
24New Mexico
25Wisconsin
26Gonzaga
27Nebraska
28Colorado St
29Dayton
30Clemson
31Virginia
32Mississippi St
33Boise St
34Texas
35St Mary’s CA
36TCU
37Drake
38Oklahoma
39Grand Canyon
40Northwestern
41Colorado
42Texas A&M
43Washington St
44Indiana St
45Pittsburgh
46NC State
47FL Atlantic
48Syracuse
49Providence
50Michigan St
51Oregon
52St John’s
53James Madison
54Seton Hall
55Mississippi
56Kansas St
57Ohio St
58Indiana
59Princeton
60Wake Forest
61Cincinnati
62Iowa
63Butler
64Virginia Tech
65Villanova
66Samford
67Richmond
68Duquesne
69McNeese St
70Utah
71Memphis
72Loyola-Chicago
73South Florida
74UNLV
75Florida St
76Boston College
77Xavier
78UCF
79LSU
80Georgia
81VCU
82Minnesota
83UAB
84Washington
85Bradley
86San Francisco
87Appalachian St
88Arkansas
89Rutgers
90Cornell
91Vermont
92Miami FL
93Col Charleston
94Maryland
95Penn St
96Georgia Tech
97St Joseph’s PA
98Yale
99USC
100UC Irvine
101St Bonaventure
102Santa Clara
103George Mason
104Massachusetts
Cheers.

Posted on March 21, 2024, in Uncategorized and tagged , , , , . Bookmark the permalink. Leave a comment.

Leave a comment