Sitemap

A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.

Pages

About me

Posts

CV of failures

2 minute read

Published:

“The only real mistake is the one from which we learn nothing.”
- Henry Ford

Binomial in the Soccer field

3 minute read

Published:

“To be or not to be ..nomial”
- Statistical bad joke based on William Shakespeare, by me.

First post!

less than 1 minute read

Published:

“I have a dream.”
- Martin Luther King

portfolio

projects

Bike Sharing Machine Learning Model

Published:

Demand forecasting is an important aspect for many companies in carrying out their operations. In this project we analyze the bike-sharing dataset from UCI Machine Learning Repository, and build a regression learning machine model using the Random Forest Regressor algorithm to predict the count of bike rentals based on time and weather-related information. For this project we use Docker, making it easier to replicate the analysis.
$\bigstar$ Click here to look at the original GitHub repository of this project.

Feature selection - R and Python packages

Published:

If you’ve ever encountered a dataset with a myriad number of features, you know it could be very difficult to work with all of them. Some features may not even be important or relevant and could even cause optimization bias. One approach to this problem is to select a subset of these features for your model. Feature selection will reduce both complexity and time training an algorithm, as well as improve the accuracy of your model – if we select them wisely. However, this is not a trivial task and to that end we have created two packages addressing the feature-selection, in Python and R programming languages.
$\bigstar$ In the next links you can access to the original feature-selection sites: package for Python and package for R.

Guide and tips to use SIE API

Published:

Banco de Mexico publishes the archive of national economic databases through the Economic Information System, known as SIE. Also, this Central Bank has developed an API that allows Developers, Analysts and Researchers to consult automatically these time series. To complement this effort, as a personal project, I prepared a guide to disclosure the SIE API, explaining how to retrieve information using the siebanxicor R package developed by Banco de Mexico, as well as a custom function to explore the selected series and a Dashboard to look into currency time series.
$\bigstar$ Click here to look the GitHub repository with the complete guide, tips and dashboard to use the SIE API with R.

Guide to use Haver Analytics

Published:

Haver Analytics is a company that collects Economic, Financial and Monetary time series in real time from international and official primary sources as Central Banks and Governments. Some institutions, as Banco de Mexico, have a subscription to consult the Haver Analytic’s databases. The aim of this project is to share a Guide to use Haver Analytics, as R bookdown, which initial objective is helping my colleagues from Banco de Mexico to retrieve information from these databases using R, Python and tools developed by Haver.
$\bigstar$ Click here to see the Guide to use Haver Analytics.

Canada response to COVID-19

Published:

The idea of this project was born on March 11th of 2020, when the Prime Minister of Canada, Justin Trudeau, announced a series of policies to help Canadians cope with the COVID-19. The objective is to measure the impact of the Government policies to help Canadians cope with the COVID-19, studying people’s perception by making sentiment analysis on users’ tweets, before and after the Prime Minister’s announcement.
$\bigstar$ Click here to look at the original GitHub repository of this project.

Choosing a baby name using Python

Published:

Getting a name for a baby is not as trivial as people could think, or at least not for my wife and me. We look for names for our baby girl everywhere and we pick some options. However, I started to wonder if we were looking in the right places and if there could be a way to measure which would be the best name for our daughter. So, I found three databases with names from Spanish and English-speaking countries, analyzed trends and top frequent names, and wrote down a list of all possible names. Finally, I developed an tool that transforms the names using the International Phonetic Alphabet and measures how well a first name sounds with a middle name and/or family names in both English and Spanish, by returning a score to make it easier finding possible names for a baby.
$\bigstar$ Click here o see the GitHub repository with the complete analysis of this project.

Project development course

Published:

In January 2022, I start teaching the Project Development course of the Master in Data Science at UDG. I took this as a Medium-term personal project and, after two years of inherit this course, I selected different data science material and tools for the students to learn and use it for future projects and, in some cases, for their thesis. These material includes collaborative yhe use of version control software GitHub, an overview of Markdown and User Interface commands, perform EDA and use tools for web scraping, interactive graphics, text analysis, wordclouds and maps with Python.
$\bigstar$ Click here o see the GitHub public repository with the material of the course.

publications

Portfolio Investments with Scenarios using Heuristics.

Published in Bibliotecas UDLAP, 2006

This project develops a Scenarios Model for Investment Portfolios, where Genetic Algorithms were used to find the Return and Risk of the model. Warning: original document in Spanish.

Recommended citation: Cuspinera Contreras, V.H. (2006). "Carteras de Inversión con Escenarios Aplicando Heurísticas." Tesis Licenciatura. Actuaría. Departamento de Actuaría y Matemáticas, Escuela de Ingeniería y Ciencias, Universidad de las Américas Puebla. http://catarina.udlap.mx/u_dl_a/tales/documentos/lat/cuspinera_c_vh/

Caste and Religion-Based Wage Discrimination in the Indian Private Sector: Evidence from the Indian Human Development Survey

Published in The Review of Black Political Economy, 2016

Using data from the IHDS, we examine evidence of caste and religion-based discrimination in the Indian private sector compared with the public sector for both Dalits and Adivasis. This is result of affirmative action policies in the government institutions, and arise the question if similar affirmative action should be implemented in the private sector?

Recommended citation: Axmann, N., Swanson, K. & Cuspinera-Contreras, V. (2016). "Caste and Religion-Based Wage Discrimination in the Indian Private Sector: Evidence from the Indian Human Development Survey." Rev Black Polit Econ, Volume 43, issue 2, pages 165-175. https://doi.org/10.1007/s12114-016-9235-8

talks

teaching

Teaching experience 1

Undergraduate course, University 1, Department, 2014

This is a description of a teaching experience. You can use markdown like any other post.

Teaching experience 2

Workshop, University 1, Department, 2015

This is a description of a teaching experience. You can use markdown like any other post.