R語言線性回歸

線性回歸模型

線性回歸模型的計算

#lm()可以完成多元線性回歸函式的估計，回歸系統與回歸方程的檢驗的工作
#summary()函式，返回列表內容
#x1表示體重，x2表示年齡，y表示對應體重與年齡下的血壓
blood
data
.frame(
x1=c(76.0, 91.5, 85.5, 82.5, 79.0, 80.5, 74.5, 
79.0, 85.0, 76.5, 82.0, 95.0, 92.5),
x2=c(50, 20, 20, 30, 30, 50, 60, 50, 40, 55, 
40, 40, 20),
y= c(120, 141, 124, 126, 117, 125, 123, 125,
132, 123, 132, 155, 147)
)lm.sol
+x1+x2, data
=blood)#1表示常數項
summary(lm.sol)
call:
lm(formula = y ~ 1
+ x1 + x2, data
= blood)
residuals:
min1q  median      3q     max
-4.0404
-1.0183
0.4640
0.6908
4.3274 
coefficients:
estimate std. error t value pr(>|t|)    
(intercept) -
62.96336
16.99976
-3.704
0.004083
** x1            2.13656
0.17534
12.185
2.53e-07
***x2            0.40022
0.08321
4.810
0.000713
***---
signif. codes:  0 『***』 0.001 『**』 0.01 『*』 0.05 『.』 0.1 『 』 1
residual standard error: 2.854
on10 degrees of freedom
multiple r-squared:  0.9461,    adjusted r-squared:  0.9354 
f-statistic: 87.84on2
and10 df,  p-value: 4.531e-07

總結

1.call：為函式所使用的模型

2.residuals:殘差，列出了最小值，1/4，中位數，3/4，最大值。

3.coefficients:係數，estimate表示估計值（截距，x1,x2係數），3.1.std.error表示各個估計值的標準差

3.2.t value表示t統計值

3.3.pr(>|t|)表示對應統計量的p值

3.4.signif. codes:顯著性檢測,p值在那個區間就用對應的星號表

4.residal standard error表示殘差的標準差，自由度為n-p-1

5.multiple r-squared 表示相關係數的平方，r^2

6.adjusted r-squared 表示修正相關係數平方，這個值小於r^2，其目的是不要輕易做出自變數與因變數相關的判斷。消除了r平方對自變數個數的依賴。

7.f-statistic 表示f統計量，自由度為（p,n-p-1）,p-value表示f統計量對應的p值

y = -62.96 +2.136
*x1 + 0.4002
*x2

**區間與置信區間

#在r中使用predict()函式來計算y0的估計值，**區間和e（y0）的置信區
間。#置信區間估計(confidence interval estimate)：利用估計的回歸方程，對於自變數 x 的乙個給定值 x0 ，求出因變數 y 的平均值的估計區間。
#**區間估計(prediction interval estimate)：利用估計的回歸方程，對於自變數 x 的乙個給定值 x0 ，求出因變數 y 的乙個個別值的估計區間。
## 一元回歸估計
x為含碳量，y為合金強度
x0.17, 0.18, 0.20, 0.21, 0.23)
y53.0, 50.0, 55.0, 55.0, 60.0)
lm.sol
summary(lm.sol)
## 計算**值, 並繪圖
newpp
#3行15列（fit      lwr      upr）
pc#3行15列（fit      lwr      upr）
par(mai=c(0.8, 0.8, 0.2, 0.2))
#這句程式？
matplot(new$x, cbind(pp, pc[,-1]), type="l",
xlab="x", ylab="y", lty=c(1,5,5,2,2), 
col=c("blue", "red", "red", "brown", "brown"), 
lwd=2)
#new$x為方向上的15個值，pp與pc的fit值相同，故去除相同一列，cbine表示y方向上的值
points(x,y, cex=1.4, pch=21, col="red", bg="orange")
#增加出x,y的點圖
legend(0.1, 63, 
c("points", "fitted", "prediction", "confidence"), 
pch=c(19, na, na, na), lty=c(na, 1,5,2), 
col=c("orange", "blue", "red", "brow")
)#增加標籤
				R語言多元線性回歸
1 根據業務經驗挑選出可能對 目標變數有影響的指標 2 將這些指針對目標變數做相關性分析cor 挑選出相關係數較大的指標進一步分析 3 檢驗這些指標與目標變數是否線性關係，一般可以plot 檢驗，如果非線性，嘗試做平方或開方等方法使之變成線性 3 將相關係數較大的指標全部作為解釋變數做多遠線性回歸l...
				R語言多元線性回歸
toothpaste data.frame x1 c 0.05,0.25,0.60,0,0.25,0.20,0.15,0.05,0.15,0.15,0.20,0.10,0.40,0.45,0.35,0.30,0.50,0.50,0.40,0.05,0.05,0.10,0.20,0.10,0.50,0...
				R語言線性模型glm  logistic回歸模型
r語言廣義線性模型glm 函式 glm formula,family family.generator,data,control list formula資料關係，如y x1 x2 x3 family 每一種響應分布 指數分布族 允許各種關聯函式將均值和線性 器關聯起來。常用的family bino...

R語言線性回歸

R語言多元線性回歸

R語言多元線性回歸

R語言線性模型glm logistic回歸模型

相關推薦