国产人妻人伦精品_欧美一区二区三区图_亚洲欧洲久久_日韩美女av在线免费观看

合肥生活安徽新聞合肥交通合肥房產生活服務合肥教育合肥招聘合肥旅游文化藝術合肥美食合肥地圖合肥社保合肥醫院企業服務合肥法律

代寫MS6711、代做Python語言程序
代寫MS6711、代做Python語言程序

時間:2025-03-07  來源:合肥網hfw.cc  作者:hfw.cc 我要糾錯



MS6711 Data Mining
Homework 2
Instruction
This homework contains both coding and non-coding questions. Please submit two files,
1. One word or pdf document of answers and plots of ALL questions without coding details.
2. One jupyter notebook of your codes.
3. Questions 1 and 2 are about concepts, 3 - 6 are about coding.
1
Problem 1 [20 points]
We perform best subset, forward stepwise and backward stepwise selection on the same dataset with p
predictors. For each approach, we obtain p + 1 models containing 0, 1, 2, · · · , p predictors. Explain your
answer.
1. Which of the three models with same number of k predictors has smallest training RSS?
2. Which of the three models with same number of k predictors has smallest testing RSS? (best
subset, forward, backward, or cannot determine?)
3. True or False: The predictors in the k-variable model identified by forward stepwise are a subset of
the predictors in the (k + 1)-variable model identified by forward stepwise selection.
4. True or False: The predictors in the k-variable model identified by best subset are a subset of the
predictors in the (k + 1)-variable model identified by best subset selection.
5. True or False: The lasso, relative to OLS, is less flexible and hence will give improved prediction
accuracy when its increase in bias is less than its decrease in variance.
2
Problem 2 [20 points]
Suppose we estimate Lasso by minimizing
||Y − Xβ||2
2 + λ||β||1
for a particular value of λ. For part 1 to 5, indicate which of (a) to (e) is correct and explain your answer.
1. As we increase λ from 0, the training RSS will
(a) Increase initially, and then eventually start decreasing in an inverted U shape.
(b) Decrease initially, and then eventually start increasing in a U shape.
(c) Steadily increase.
(d) Steadily decrease.
(e) Remain constant.
2. Repeat 1. for test RSS.
3. Repeat 1. for variance.
4. Repeat 1. for (squared) bias.
3
Problem 3 [20 points]
These data record the level of atmospheric ozone concentration from eight daily meteorological mea surements made in the Los Angeles basin in 1976. We have the 330 complete cases1. We want to find
climate/weather factors that impact ozone readings. Ozone is a hazardous byproduct of burning fossil
fuels and can harm lung function. The data set for this problem is:
Variable name Definition
ozone Long Maximum Ozone
vh Vandenberg 500 mb Height
wind Wind speed (mph)
humidity Humidity (%)
temp Sandburg AFB Temperature
ibh Inversion Base Height
dpg Daggot Pressure Gradint
ibt Inversion Base Temperature
vis Visibility (miles)
doy Day of the Year
[Note: I would recommend you use R for this question, since python does not have package for
forward / backward selection. See the code example on Canvas. Or you may use the sample python code
I provided.]
1. Report result of linear regression using all variables. Note that ozone is the response variable to
predict. What variables are significant?
2. Report the selected variables using the following model selection approaches.
(a) All subset selection.
(b) Forward stepwise
(c) Backward stepwise
3. Compare the outcome of these methods with the significant variables found in the full linear regres sion in question 1.
4. Potentially, other transformation of covariates might be important. What happens if you do all
subset selection using both the original variables and their square? That is, for all variables, include
4
both
X, X2
in the linear regression model for all subset selection.
5
Problem 4 [20 points]
In this exercise, we will predict the number of applications received using the other variables in the College
data set.
Private Public/private school indicator
Apps Number of applications received
Accept Number of applicants accepted
Enroll Number of new students enrolled
Top10perc New students from top 10% of high school class
Top25perc 1 = New students from top 25 % of high school class
F.Undergrad Number of full-time undergraduates
P.Undergrad Number of part-time undergraduates
Outstate Out-of-state tuition
Room.Board Room and board costs
Books Estimated book costs
Personal Estimated personal spending
PhD Percent of faculty with Ph.D.
Terminal Percent of faculty with terminal degree
S.F.Ratio Student faculty ratio
perc.alumni Percent of alumni who donate
Expend Instructional expenditure per student
Grad.Rate Graduation rate
1. Split the data set into a training set and a test set.
2. Fit a linear regression model using OLS on the training set, and report the test error obtained.
3. Fit a ridge regression model on the training set, with λ chosen by cross-validation. Report the test
error obtained.
4. Fit a lasso model on the training set, with λ chosen by cross-validation. Report the test error
obtained, along with the number of non-zero coefficient estimates.
5. Fit a PCR model on the training set, with number of components chosen by cross-validation. Report
the test error obtained, along with the value of M selected by cross-validation.
6. Fit a PLS model on the training set, with number of components chosen by cross-validation. Report
the test error obtained, along with the value of number of components selected by cross-validation.
6
Problem 5 [20 points]
We will now try to predict per capita crime rate in the Boston data set.
crim per capita crime rate by town.
zn proportion of residential land zoned for lots over 25,000 sq.ft.
indus proportion of non-retail business acres per town.
chas Charles River dummy variable (= 1 if tract bounds river; 0 otherwise).
nox nitrogen oxides concentration (parts per 10 million).
rm 1 = average number of rooms per dwelling.
age proportion of owner-occupied units built prior to 1940.
dis weighted mean of distances to five Boston employment centres.
rad index of accessibility to radial highways.
tax full-value property-tax rate per $10,000.
ptratio pupil-teacher ratio by town.
black 1000(Bk − 0.63)2 where Bk is the proportion of blacks by town.
lstat lower status of the population (percent).
medv median value of owner-occupied homes in $1000s.
1. Try out some of the regression methods explored in this chapter, such as best subset selection, the
lasso, ridge regression, PCR and partial least squares. Present and discuss results for the approaches
that you consider.
2. Propose a model (or set of models) that seem to perform well on this data set, and justify your
answer. Make sure that you are evaluating model performance using validation set error, cross validation, or some other reasonable alternative, as opposed to using training error.
3. Does your chosen model involve all of the features in the data set? Why or why not?
7
Problem 6 [20 points]
In a bike sharing system the process of obtaining membership, rental, and bike return is automated
via a network of kiosk locations throughout a city. In this problem, you will try to combine historical
usage patterns with weather data to forecast bike rental demand in the Capital Bikeshare program in
Washington, D.C.
You are provided hourly rental data collected from the Capital Bikeshare system spanning two years.
The file Bike train.csv, as the training set, contains data for the first 19 days of each month, while
Bike test.csv, as the test set, contains data from the 20th to the end of the month. The dataset includes
the following information:
daylabel day number ranging from 1 to 731
year, month, day, hour hourly date
season 1=spring,2=summer,3=fall,4=winter
holiday whether the day is considered a holiday
workingday whether the day is neither a weekend nor a holiday
weather 1 = clear, few clouds, partly cloudy
2 = mist + cloudy, mist + broken clouds, mist + few clouds, mist
3 = light snow, light rain + thunderstorm + scattered clouds, light rain
4 = 4 = heavy rain + ice pallets + thunderstorm + mist, snow + fog
temp temperature in Celsius
atemp ’feels like’ temperature in Celsius
humidity relative humidity
wind speed wind speed
count number of total rentals, outcome variable to predict
Predictions will be evaluated using the root mean squared error (RMSE), calculated as
RMSE =
v
u
u t
n
1
nX
i=1
(yi − ybi)
2
where yi
is the true count, ybi
is the prediction, and n is the number of entries to be evaluated.
Build a model on train dataset to predict the bikeshare counts for the hours recorded in the test
dataset. Report your prediction RMSE on testing set.
Some tips
• This is a relatively open question, you may use any model you learnt from this class.
8
• It will be helpful to examine the data graphically to spot any seasonal pattern or temporal trend.
• There is one day in the training data with weird atemp record and another day with abnormal
humidity. Find those rows and think about what you want to do with them. Is there anything
unusual in the test data?
• It might be helpful to transform the count to log(count + 1). If you did that, do not forget to
transform your predicted values back to count.
• Think about how you would include each predictor into the model, as continuous or as categorical?
• Is there any transformation of the predictors or interactions between them that you think might be
helpful?
Try to summarize your exploration of the data, and modeling process. You may fit a few models and
chose one from them. You will receive points based on your write-up and test RMSE. This is not a
competition among the class to achieve the minimal RMSE, but your result should be in a reasonable
range.


請加QQ:99515681  郵箱:99515681@qq.com   WX:codinghelp



 

掃一掃在手機打開當前頁
  • 上一篇:INT5051代做、代寫Python編程設計
  • 下一篇:代寫COMP3334、代做C/C++,Python編程
  • 無相關信息
    合肥生活資訊

    合肥圖文信息
    流體仿真外包多少錢_專業CFD分析代做_友商科技CAE仿真
    流體仿真外包多少錢_專業CFD分析代做_友商科
    CAE仿真分析代做公司 CFD流體仿真服務 管路流場仿真外包
    CAE仿真分析代做公司 CFD流體仿真服務 管路
    流體CFD仿真分析_代做咨詢服務_Fluent 仿真技術服務
    流體CFD仿真分析_代做咨詢服務_Fluent 仿真
    結構仿真分析服務_CAE代做咨詢外包_剛強度疲勞振動
    結構仿真分析服務_CAE代做咨詢外包_剛強度疲
    流體cfd仿真分析服務 7類仿真分析代做服務40個行業
    流體cfd仿真分析服務 7類仿真分析代做服務4
    超全面的拼多多電商運營技巧,多多開團助手,多多出評軟件徽y1698861
    超全面的拼多多電商運營技巧,多多開團助手
    CAE有限元仿真分析團隊,2026仿真代做咨詢服務平臺
    CAE有限元仿真分析團隊,2026仿真代做咨詢服
    釘釘簽到打卡位置修改神器,2026怎么修改定位在范圍內
    釘釘簽到打卡位置修改神器,2026怎么修改定
  • 短信驗證碼 寵物飼養 十大衛浴品牌排行 suno 豆包網頁版入口 wps 目錄網 排行網

    關于我們 | 打賞支持 | 廣告服務 | 聯系我們 | 網站地圖 | 免責聲明 | 幫助中心 | 友情鏈接 |

    Copyright © 2025 hfw.cc Inc. All Rights Reserved. 合肥網 版權所有
    ICP備06013414號-3 公安備 42010502001045

    国产人妻人伦精品_欧美一区二区三区图_亚洲欧洲久久_日韩美女av在线免费观看
    男人天堂成人在线| 久久视频中文字幕| 国产精品成人av性教育| 欧美精彩一区二区三区| 久久精品一区中文字幕| 国内精品视频免费| 精品伦精品一区二区三区视频| 国产色综合一区二区三区| 一区二区三区不卡在线| 国产精品69久久| 欧美专区中文字幕| 欧美精品少妇videofree| 成人做爽爽免费视频| 亚洲精品成人自拍| 日韩中文字幕在线播放| 国产一区二区中文字幕免费看| 一区二区视频在线观看| 7777奇米亚洲综合久久| 日本视频一区在线观看| 国产成人精品在线| 国产一级特黄a大片99| 亚洲色成人一区二区三区小说| 国产成人一区二区三区| 黄色特一级视频| 亚洲综合色av| 深夜福利日韩在线看| 国产在线资源一区| 日韩一级片一区二区| 国产精品乱码视频| 成人国内精品久久久久一区| 日韩高清国产精品| 欧美wwwxxxx| 久久久久久久久网站| 国产综合 伊人色| 日本一区二区在线视频| 国产精品成人播放| 久久久久se| 国产欧美日韩精品在线观看| 日韩.欧美.亚洲| 精品久久久久久久免费人妻| 久久久综合免费视频| 国产一区二区高清不卡| 日本三级中文字幕在线观看| 欧美猛少妇色xxxxx| 色婷婷av一区二区三区在线观看| 国产日韩精品在线| 日韩偷拍一区二区| 亚洲综合在线做性| 国产精品区一区二区三含羞草 | 日产国产精品精品a∨| 国产精品福利网站| 久久国产精品视频在线观看| 成人免费在线小视频| 国语对白做受xxxxx在线中国| 日本一区二区三不卡| 在线观看一区二区三区三州| 国产精品久久久久久av福利| 久久精品五月婷婷| 成人国产在线看| 国产人妻人伦精品| 国语自产精品视频在线看| 日韩精品第一页| 无码少妇一区二区三区芒果| 欧美日本在线视频中文字字幕| 久久久国产视频| 国产成人黄色av| 91精品国产高清久久久久久91裸体| 国产在线精品一区二区三区| 欧美午夜性视频| 日本一区二区三区四区五区六区| 欧美精品激情视频| 精品毛片久久久久久| 国产精品久久久久久亚洲影视 | 日本久久久久亚洲中字幕| 伊人久久婷婷色综合98网| 不用播放器成人网| 国产精品久久久久久久一区探花| 日韩中文字幕网| 深夜福利91大全| 久久99精品久久久久久水蜜桃| 久久久欧美精品| 久久久亚洲国产天美传媒修理工| 91美女片黄在线观看游戏| 99国内精品久久久久久久软件| 成人h在线播放| 国产精品中文字幕在线观看| 国产一区二区网| 麻豆一区区三区四区产品精品蜜桃| 欧美精品一区二区视频| 青青草国产精品一区二区| 日本不卡免费新一二三区| 日韩欧美精品在线不卡| 日本高清视频精品| 日韩免费观看网站| 人人妻人人添人人爽欧美一区 | 霍思燕三级露全乳照| 青青青青在线视频| 日韩精品一区二区三区色偷偷| 日本成人黄色| 日韩精品久久一区二区| 日韩精品―中文字幕| 欧美视频免费看欧美视频| 欧美日韩国产精品一卡| 国内精品免费午夜毛片| 国产深夜精品福利| 成人久久久久久| 91国视频在线| 国产传媒一区| 俺也去精品视频在线观看| 久久精品一本久久99精品| 超碰日本道色综合久久综合| 欧美激情xxxx| 性亚洲最疯狂xxxx高清| 秋霞成人午夜鲁丝一区二区三区| 欧美精品与人动性物交免费看| 国产综合中文字幕| 99在线免费视频观看| 国产成人精品免费视频| 精品国产一区二区三区在线观看| 国产精品美女久久久免费| 欧美wwwxxxx| 午夜啪啪免费视频| 日韩免费毛片| 国产综合av一区二区三区| 97精品视频在线| 日韩一区二区精品视频| 国产精品久久久久9999爆乳| 中国丰满熟妇xxxx性| 日韩av在线播放不卡| 国内自拍在线观看| 成人精品一区二区三区| 九色91视频| 欧美精品一区二区免费| 欧美一区二区视频17c| 欧美一区激情视频在线观看| 国产色婷婷国产综合在线理论片a| www.国产二区| 日韩亚洲精品电影| 九色91av视频| 日韩视频免费播放| 国产伦精品一区二区三区四区视频_ | 国产一区二区片| 91精品视频免费| 国产成人欧美在线观看| 中文字幕不卡每日更新1区2区| 熟女视频一区二区三区| 麻豆av一区二区| 91精品国产综合久久香蕉最新版| 精品国产网站地址| 亚洲综合激情五月| 欧美高清视频一区| 91精品国产色综合久久不卡98| 国产精品美女免费| 日本一欧美一欧美一亚洲视频| 国产毛片久久久久久国产毛片| 色婷婷综合久久久久| 亚洲最大成人网色| 欧美激情一区二区三区在线视频| av在线免费观看国产| 国产精品少妇在线视频| 五月天综合婷婷| 精品一区2区三区| 久久久精品动漫| 国产99视频在线观看| 青青精品视频播放| 68精品久久久久久欧美| 久久国产色av| 欧美日韩在线不卡一区| 久久久神马电影| 欧美激情欧美激情在线五月| 欧美国产一二三区| 久久精品一区二| 亚洲一区免费看| 国产日韩综合一区二区性色av| 日韩中文理论片| 日日摸日日碰夜夜爽av| 国产精品夜色7777狼人| 国产精品久久久久77777| 日本精品久久电影| 7777精品视频| 亚洲综合日韩中文字幕v在线| 精品一区久久| 国产成人免费av电影| 性一交一乱一伧国产女士spa| 国产精品一区二区三区精品| 国产精品久久久久久久久久久久| 日韩精品视频久久| 久久免费精品视频| 亚洲精品日韩精品| 99精品人妻少妇一区二区| 欧美日韩国产999| 国产日本一区二区三区| 国产精品欧美激情在线播放| 日日噜噜夜夜狠狠久久丁香五月| 99久久伊人精品影院| 在线观看欧美亚洲| 国产精品一久久香蕉国产线看观看 | 国产在线观看91精品一区| 国产精品视频精品| 欧美亚洲免费在线|