Comparison Between Logistic Regression and Random Forest for Determining Factors of Intimate Partner Violence in Peru

Authors

  • Ashley Mercedes Guerrero-Muguerza

DOI:

https://doi.org/10.26439/ciis2019.5518

Abstract

Intimate  partner  violence  is  a  social  problem  that  has  been  studied  by  different  researchers  to  determine  the  factors  that  influence  its  occurrence,  considering  different  environments, moments and locations. Sixty-eight point two percent (68.2%) of women have been victims of violence and 31.7% have been victims of physical aggression in Peru. The present research proposes  nine  models  based  on  logistic  regression  and  random  forest  with  variants  such  as   chi-square, Entropy and Gini, and three subscenarios out of five, ten and twenty variables that used the dataset of complaints registered in 2016 at the Ministry of Women. The best result of each subscenario was obtained, but finally the best model was that of twenty variables which used the random forest “feature selection” (Entropy) and the random forest model (Gini).

Downloads

Download data is not yet available.

Downloads

Published

2020-07-15

How to Cite

Comparison Between Logistic Regression and Random Forest for Determining Factors of Intimate Partner Violence in Peru. (2020). Actas Del Congreso Internacional De Ingeniería De Sistemas, 252-253. https://doi.org/10.26439/ciis2019.5518