predictive model

I have several pieces of data available, but I'm not sure about the best use of it in a predictive model. I envision using a regression, then plugging the predicted values into a monte carlo simulation for optimization. I'm wondering how an experienced statistician would use the available data...

I want to know your mind about one point related to logit regression model. I am using logit model for my analyzing. My key independent variable is the lagged; hence, it is predictive regression. So, my dependent variable is 0 and 1. These 0 and 1 is related to the time of the day. To make it...

i made a random forest predictive model and want my users to input the independent variables and predict the dependent variable. i'm using shiny to implement this.
i am getting the following error as the reactive values are stored as character inputs where as my predictive model is built with...

Hello dear forum members!
Currently, I am working on a project that aims to predict a certain cancer-related outcome (y) using a number of control (c) and predictor (X) variables:
y(i) = a + c(it) + X(it) + u (1)
In Equation (1): y(i) is continuous in nature, data is available only...

Hi Everyone,
My question is how do I calculate weights for Y=W1X1*W2X2*WNXN where N is the number of independent variables.
I could use logarithms to convert this to an additive model, but each of the independent variables are binary/dummy variables. Here is some background.
Assume...

Hi - I am a HR professional looking to self learn statistical modeling for new responsibilities at work..Need to forecast no. of employees who may retire next 10 years. What would be simple way to forecast for this? I have historical retirements data broken by age groups (50-55 etc).
Have...

Hello,
I apologize if I am posting in the incorrect place. If so, could someone please direct me or move this thread? Or if something similar has been asked can someone link me to the thread?
I am an accountant/finance guy and my knowledge of statistics and quant analysis is limited to...

I am Achyut Sarma B, a final year student pursuing bachelors in Computer Sciences from Amrita University, Coimbatore, India. I am new to data sciences but am being fascinated by it and it's ways to solve real world business problems. Here are some problems that are really annoying me since the...

Hi Stats Gurus,
I am working on a model that predicts gestation period of new hires. By gestation period, i mean the period between they completed their trainings and the time they start working on their first real live project. I have data for gestation period in months. I classified the...

I have a two-class classification problem. I would like to train a multivariate classifier with 100% positive predictive value. In other words, I want the model to completely avoid one of the classes. For this application a low-ish sensitivity is OK as long as PPV is ~100%.
Do you have any...

I am trying to use LASSO for variable selection, on a balanced panel. I have a total of 14 predictors, and would like to reduce this variable space.
The panel is comprised of n dependent variables, each having t observations. I am planning to run LASSO on the cross-section for each time t...

I am producing a logistical regression analysis to find a model to predict whether someone has minor mental health problems or not.
I'm almost certain i've coded all my data correctly but no matter which variable I put into the model it doesn't predict any yes outcomes. my classification table...

I have binomial frequency data for an allele associated with populations living in mountainous. These mountains run north to south where sites are nearly fixed for this allele and lowland sites to the east that lack the allele. At the south end of the mountains is a hybrid zone. Despite the...

Suppose I have built a model to predict default on loans.
In my modeling data set I have three independent variables: income, home(own or rent) and job(manager, sales, self, other). They are all very strong predictors.
For some reason, I won't be able to have the variable "job" in my scoring...

I have a real world business problem of selecting products to recommend to our contacts in an email campaign.
The data set I have contains the click thrus of our products' ads on various websites and other info about the visitors of the websites that have the ads on them.
Some of the variables...