Renglish – Freakonometrics

Channel: Renglish – Freakonometrics

Image may be NSFW.
Clik here to view.

Extracting information from a picture, round 1

December 7, 2018, 6:03 pm

This week, I wanted to get information I found on the nice map, below. I could not get access to the original dataset, per zip code… and I was wondering, if (assuming that the map was with high...

View Article

Image may be NSFW.
Clik here to view.

Extracting information from a picture, round 2

December 8, 2018, 7:44 pm

Yesterday, I published a post on extracting information from a picture, but it did not work as expected. I claimed that it was because of the original graph I had. More precisely, the was based on some...

View Article

Image may be NSFW.
Clik here to view.

Exotic link functions for GLMs

December 10, 2018, 6:48 pm

In my previous post on GLMs, I discussed power link functions. But there are much more links that can be used : The square root link (for the Poisson model) Consider some random variable Y with mean...

View Article

Image may be NSFW.
Clik here to view.

Estimates on training vs. validation samples

May 23, 2019, 1:44 pm

Before moving to cross-validation, it was natural to say “I will burn 50% (say) of my data to train a model, and then use the remaining to fit the model”. For instance, we can use training data for...

View Article

Image may be NSFW.
Clik here to view.

Optimal transport on large networks

July 4, 2019, 7:34 pm

With Alfred Galichon and Lucas Vernet, we recently uploaded a paper entitled optimal transport on large networks on arxiv. This article presents a set of tools for the modeling of a spatial allocation...

View Article

Image may be NSFW.
Clik here to view.

Insurance data science : use and value of unusual data #1

August 5, 2019, 5:55 pm

Next week, with , I will be at the Summer School of the Swiss Association of Actuaries, in Lausanne, with Jean-Philippe Boucher (UQAM) and Ewen Gallic (AMSE). I will give an introductionary talk on...

View Article

Image may be NSFW.
Clik here to view.

Insurance data science : Pictures

August 13, 2019, 2:20 pm

At the Summer School of the Swiss Association of Actuaries, in Lausanne, following the part of Jean-Philippe Boucher (UQAM) on telematic data, I will start talking about pictures this Wednesday....

View Article

Image may be NSFW.
Clik here to view.

Insurance data science : Text

August 14, 2019, 2:25 pm

At the Summer School of the Swiss Association of Actuaries, in Lausanne, I will start talking about text based data and NLP this Thursday. Slides are available online Ewen Gallic (AMSE) will present a...

View Article

Image may be NSFW.
Clik here to view.

Insurance data science : Networks

August 15, 2019, 2:27 pm

At the Summer School of the Swiss Association of Actuaries, in Lausanne, I will start talking about networks and insurance this Friday. Slides are available online

View Article

Image may be NSFW.
Clik here to view.

On leverage

October 3, 2019, 7:14 am

Last week, in our STT5100 (applied linear models) class, I’ve introduce the hat matrix, and the notion of leverage. In a classical regression model, \boldsymbol{y}=\boldsymbol{X}\boldsymbol{\beta} (in...

View Article

Image may be NSFW.
Clik here to view.

Combining automatically factor levels with trees

October 3, 2019, 8:21 am

Last year, in a post, I discussed how to merge levels of factor variables, using combinatorial techniques (it was for my STT5100 cours, and trees are not in the syllabus), with an extension on trees at...

View Article

Image may be NSFW.
Clik here to view.

On the conjugate function

January 13, 2020, 4:36 pm

In the MAT7381 course (graduate course on regression models), we will talk about optimization, and a classical tool is the so-called conjugate. Given a function f:\mathbb{R}^p\to\mathbb{R} its...

View Article

Image may be NSFW.
Clik here to view.

On Cochran Theorem (and Orthogonal Projections)

January 15, 2020, 8:46 am

Cochran Theorem – from The distribution of quadratic forms in a normal system, with applications to the analysis of covariance published in 1934 – is probably the most import one in a regression...

View Article

Image may be NSFW.
Clik here to view.

Quantile Regression (home made, part 2)

February 17, 2020, 12:38 pm

A few months ago, I posted a note with some home made codes for quantile regression… there was something odd on the output, but it was because there was a (small) mathematical problem in my equation....

View Article

Image may be NSFW.
Clik here to view.

Lasso Regression (home made)

February 17, 2020, 5:32 pm

Again, this post is related to my MAT7381 course, where we will see that it is actually possible to write our own code to compute Lasso regression,...

View Article

Image may be NSFW.
Clik here to view.

Testing for a causal effect (with 2 time series)

February 19, 2020, 6:03 pm

A few days ago, I came back on a sentence I found (in a French newspaper), where someone was claiming that “… an old variable explains 85% of the change in a new variable. So we can talk about...

View Article

Image may be NSFW.
Clik here to view.

Function basis and regression

March 1, 2020, 2:30 pm

In the first part of the course on linear models, we’ve seen how to construct a linear model when the vector of covariates \boldsymbol{x} is given, so that \mathbb{E}(Y|\boldsymbol{X}=\boldsymbol{x})...

View Article

Image may be NSFW.
Clik here to view.

Testing for Covid-19 in the U.S.

April 28, 2020, 6:22 pm

For almost a month, on a daily basis, we are working with colleagues (Romuald, Chi and Mathieu) on modeling the dynamics of the recent pandemic. I learn of lot of things discussing with them, but we...

View Article

Image may be NSFW.
Clik here to view.

Regression discontinuity model for TV series

July 12, 2020, 8:24 pm

In September, we are usually happy to see our favorite TV series back on air… Or not? Because, admit it, if we are happy to see those characters back, most of the time, we are disappointed, too. So why...

View Article

Sharing pictures from holidays in the Canadian Rockies (with R)

August 9, 2020, 4:26 pm

My kids have a very popular blog (at least among their grandmothers) where they frequently post pictures from everyday’s life (since they live 5000km from them), as well as pictures taken from...

View Article

Hidding values in the output of the summary function for a (linear) regression

August 12, 2020, 7:55 am

Since our Fall 2020 session will be 100% online (and off-site), I have to work hard this summer to prepare online quizz and exams. I started intensively to play with Achim’s awesome r-exams package....

View Article

Image may be NSFW.
Clik here to view.

R0 and the exponential growth of a pandemic

August 16, 2020, 11:51 am

For some dissemination work, I want to create a nice graph to explain the exponential growth in pandemics, related to the value of R_0. Recall that R_0 corresponds to the average number of people that...

View Article

Image may be NSFW.
Clik here to view.

R0 and the exponential growth of a pandemic, an update

August 18, 2020, 7:19 am

A few days ago, I wrote a blog post – R0 and the exponential growth of a pandemic – where I was trying to generate some visualization of some exponential growth, in the context of a pandemic. After...

View Article

Image may be NSFW.
Clik here to view.

Trees and forests

November 30, 2020, 1:51 pm

For my ACT6100 weekly quiz, I usually generate some datasets, and then ask students to compare various predictive algorithms. Last week, it was about classification trees and random forests. And...

View Article

Image may be NSFW.
Clik here to view.

Insurance Pricing Game

December 18, 2020, 2:44 am

Would you like to put your data science skills to the test? Imperial College London, Universite du Quebec à Montreal (UQAM), and actuarial institutes in Singapore, the UK, including the IFoA, and...

View Article

Image may be NSFW.
Clik here to view.

Lilliefors, Kolmogorov-Smirnov and cross-validation

January 5, 2021, 7:58 am

In statistics, Kolmogorov–Smirnov test is a popular procedure to test, from a sample \{x_1,\cdots,x_n\} is drawn from a distribution F, or usually F_{\theta_0}, where F_{\theta} is some parametric...

View Article

Image may be NSFW.
Clik here to view.

Some general thoughts on Partial Dependence Plots with correlated covariates

February 12, 2021, 1:55 pm

The partial dependence plot is a nice tool to analyse the impact of some explanatory variables when using nonlinear models, such as a random forest, or some gradient boosting.The idea (in dimension 2),...

View Article

Image may be NSFW.
Clik here to view.

From multinomial regression to binary classification on some Siamese data

March 14, 2021, 2:54 pm

There are two kinds of people in the world: people who think there are two kinds of people in the world and people who don’t (borrowed from Menand (2018)). Because things are always simpler when we...

View Article

Image may be NSFW.
Clik here to view.

Could there be incentives to cycle through a red light?

August 13, 2021, 10:09 pm

This is of course a rhetorical question! Because cyclists must stop when the light is red! … But … there is always that moment, on a bicycle, when you stop, and then you say to yourself the worst part...

View Article

Image may be NSFW.
Clik here to view.

Snow in Montréal (Canada)

January 29, 2023, 7:08 am

Winter started a bit more than one month ago… but we have already experienced many snow storms… there is still a lot snow in gardens and in the streets, I was wondering if it was that unusual, but...

View Article

More Pages to Explore .....

Latest Images