twitter.com - HarryDCrane
Sports betting industry is focused on providing a profitable environment for operators so states can collect revenue. There is no place for winning bettors. They are unwanted and unwelcome.
statsbomb.com - Will Morgan
As a follower of multiple sports, it's an interesting exercise to consider what broad trends can be observed from sport-to-sport. One such shift has been the upending of the traditional roles of players and how they are deployed strategically, which have subverted how "the game is supposed to be played".
sfu.ca - Lucas Wu and Tim B. Swartz
With the availability of tracking data, the determination of pitch control (field ownership) is an increasingly important topic in sports analytics. This paper reviews various approaches for the determination of pitch control and introduces a new field ownership metric that takes into account associated sporting dynamics. The methods that are proposed utilize the movement of the ball and players. Specifically, physical characteristics such as current velocity, acceleration and maximum velocity are considered. The determination of pitch control is based on the time that it takes the ball and the players to reach a given location. The main result of our investigation concerns the validation of the resultant pitch control diagram. Based on a sample of 5887 passes, the team identified as having pitch control was the observed recipient of the pass with 91% accuracy. The approach is generally applicable to invasion sports and is illustrated in the context of soccer. Various parameters are introduced that allow a user to modify the methods to alternative sports and to introduce player-specific maximum velocities and player-specific accelerations.
statsbomb.com - StatsBomb
StatsBomb is celebrating a 10th anniversary this summer: a decade since the forming of the website to share and host work from the analytics community. The foundation of the website and the business are community-based, and we've always been keen to pay it back.
So this summer, we're going to release the 2015/16 Big 5 League seasons, on our industry-leading data spec, for free.
1,826 matches, 98 teams, ~2,500 players, and ~6,000,000 rows of event data to work with.
youtube.com - Michael Struwig
This week's reading group video focuses on the paper entitled 'BloombergGPT: A Large Language Model for Finance'. The paper has attracted considerable attention, and we are thrilled to explore it further with everyone on Friday.
We will discuss various aspects of the model, including its architecture and origins, the distinctive dataset used to train it, its assessment for financial tasks, and the extent to which it lives up to the media's hype regarding its relevance and groundbreaking nature.
twitter.com - Christina Qi
If you want to learn about the inner workings of a top HFT firm, read this case where KCG is suing a former employee.
github.io - Lilian Weng
Building agents with LLM (large language model) as its core controller is a cool concept. Several proof-of-concepts demos, such as AutoGPT, GPT-Engineer and BabyAGI, serve as inspiring examples. The potentiality of LLM extends beyond generating well-written copies, stories, essays and programs; it can be framed as a powerful general problem solver.
github.io - Percy Liang
Welcome to CS324! This is a new course on understanding and developing large language models.
pythonspeed.com - Itamar Turner-Trauring
If you’re doing numeric calculations, NumPy is a lot faster than than plain Python—but sometimes that’s not enough. What should you do when your NumPy-based code is too slow?
blogspot.com
Aren't ALL games of "uncertain outcome"? And what does a 'small streak without drawing' mean? Basically nothing, except that a pattern has been found that has no predictive value at all. I guess a small fortune was lost this year on Tottenham Hotspur, chasing that overdue Draw!
wordpress.com
Today I’m going to tackle a topic that is somewhat controversial. Myths, misconceptions and disinformation are everywhere, there is passionate debate from all directions, and there is a lot of complex stuff going on. Let’s talk about the different ways that a centralized* sportsbook can be run; specifically, how they set their odds and how they manage risk.
sfu.ca - Dani Chu, Yifan Wu and Tim B. Swartz
This paper considers an extension of the Kelly criterion used in sports wagering. By recognizing that the probability p of placing a correct wager is unknown, modified Kelly criteria are obtained that take the uncertainty into account. Estimators are proposed that are developed from a decision theoretic framework. We observe that the resultant betting fractions can differ markedly based on the choice of loss function. In the cases that we study, the modified Kelly fractions are smaller than original Kelly. Journal of Quantitative Analysis in Sports, 14, 1-11.
bloomberg.com - ByKit Chellel and Jeremy Hodges
A secretive hedge fund used the British court system to punish an IP thief‚ even though he was already in jail.
statsbomb.com - Colin Trainor
In an article published last October I took my first look at some defensive metrics. That piece was very much an introductory one as I offered up a few of my initial ideas for consideration. I’m now going to take the opportunity to expand on one of the ideas that I wrote about in that initial article, Passes Allowed Per Defensive Action (PPDA).
arxiv.org - Leszek Szczecinski and Raphaelle Tihon
In this work, we deal with the problem of rating in sports, where the skills of the players/teams are inferred from the observed outcomes of the games. Our focus is on the online rating algorithms which estimate the skills after each new game by exploiting the probabilistic models of the relationship between the skills and the game outcome. We propose a Bayesian approach which may be seen as an approximate Kalman filter and which is generic in the sense that it can be used with any skills-outcome model and can be applied in the individual- as well as in the group-sports. We show how the well-know algorithms (such as the Elo, the Glicko, and the TrueSkill algorithms) may be seen as instances of the one-fits-all approach we propose. In order to clarify the conditions under which the gains of the Bayesian approach over the simpler solutions can actually materialize, we critically compare the known and the new algorithms by means of numerical examples using the synthetic as well as the empirical data.
realpython.com - Christopher Bailey, Christopher Trudeau
We cover a recent post by previous guest Matt Harrison about using Python and pandas for finance. Matt’s article covers methods in the pandas library for aggregation, resampling, and rolling averages.