執行命令:
執行:clickhouse-client進入clickhouse客戶端for s in `seq 1987 2017`
dofor m in `seq 1 12`
domwget
done
done
建立表語句為:
sql語句需要壓縮後再執行。create table `ontime` (
`year` uint16,
`quarter` uint8,
`month` uint8,
`dayofmonth` uint8,
`dayofweek` uint8,
`flightdate` date,
`uniquecarrier` fixedstring(7),
`airlineid` int32,
`carrier` fixedstring(2),
`tailnum` string,
`flightnum` string,
`originairportid` int32,
`originairportseqid` int32,
`origincitymarketid` int32,
`origin` fixedstring(5),
`origincityname` string,
`originstate` fixedstring(2),
`originstatefips` string,
`originstatename` string,
`originwac` int32,
`destairportid` int32,
`destairportseqid` int32,
`destcitymarketid` int32,
`dest` fixedstring(5),
`destcityname` string,
`deststate` fixedstring(2),
`deststatefips` string,
`deststatename` string,
`destwac` int32,
`crsdeptime` int32,
`deptime` int32,
`depdelay` int32,
`depdelayminutes` int32,
`depdel15` int32,
`departuredelaygroups` string,
`deptimeblk` string,
`taxiout` int32,
`wheelsoff` int32,
`wheelson` int32,
`taxiin` int32,
`crsarrtime` int32,
`arrtime` int32,
`arrdelay` int32,
`arrdelayminutes` int32,
`arrdel15` int32,
`arrivaldelaygroups` int32,
`arrtimeblk` string,
`cancelled` uint8,
`cancellationcode` fixedstring(1),
`diverted` uint8,
`crselapsedtime` int32,
`actualelapsedtime` int32,
`airtime` int32,
`flights` int32,
`distance` int32,
`distancegroup` uint8,
`carrierdelay` int32,
`weatherdelay` int32,
`nasdelay` int32,
`securitydelay` int32,
`lateaircraftdelay` int32,
`firstdeptime` string,
`totaladdgtime` string,
`longestaddgtime` string,
`divairportlandings` string,
`divreacheddest` string,
`divactualelapsedtime` string,
`divarrdelay` string,
`divdistance` string,
`div1airport` string,
`div1airportid` int32,
`div1airportseqid` int32,
`div1wheelson` string,
`div1totalgtime` string,
`div1longestgtime` string,
`div1wheelsoff` string,
`div1tailnum` string,
`div2airport` string,
`div2airportid` int32,
`div2airportseqid` int32,
`div2wheelson` string,
`div2totalgtime` string,
`div2longestgtime` string,
`div2wheelsoff` string,
`div2tailnum` string,
`div3airport` string,
`div3airportid` int32,
`div3airportseqid` int32,
`div3wheelson` string,
`div3totalgtime` string,
`div3longestgtime` string,
`div3wheelsoff` string,
`div3tailnum` string,
`div4airport` string,
`div4airportid` int32,
`div4airportseqid` int32,
`div4wheelson` string,
`div4totalgtime` string,
`div4longestgtime` string,
`div4wheelsoff` string,
`div4tailnum` string,
`div5airport` string,
`div5airportid` int32,
`div5airportseqid` int32,
`div5wheelson` string,
`div5totalgtime` string,
`div5longestgtime` string,
`div5wheelsoff` string,
`div5tailnum` string
) engine = mergetree(flightdate, (year, flightdate), 8192)
向表中匯入資料:
資料匯入成功,然後查詢一下資料量。for i in *.zip; do echo $i; unzip -cq $i '*.csv' | sed 's/\.00//g' | clickhouse-client
--query="insert into ontime format csvwithnames"; done
一億七千萬資料,第一次執行2.479秒,第二次執行0.076秒。
這速度簡直逆天……
ClickHouse 備份與恢復
我們評估一款資料庫產品,除了穩定性和易用性外,資料安全也十分重要,備份與恢復往往是最後一道安全門。但如果備份策略不完善 恢復手段無效,一旦發生資料誤刪除就真的搶救無效 撲街了。目前clickhouse的備份方式有以下幾種 下面就逐個試試吧。資料備份 概述 測試資料 mysql中源資料6.70g,表資...
初識Clickhouse 安裝與部署(一)
二 安裝與部署 clickhouse是由 yandex在2016年發布的乙個分布式資料分析型資料庫,其效率要比veritca快約5倍,其不依賴於hadoop生態軟體和基礎及高可用的特性適用於許多資料分析場景 1.資料壓縮 在一些列式資料庫管理系統中 例如 infinidb ce 和 monetdb ...
node mysql部落格 node與mysql
首先我們需要先在pagejson.js的依賴中加入 mysql latest 如圖 然後npm install 這樣mysql就安裝好了。這裡為了後續專案運算元據庫方便,就建立了乙個連線資料庫的公用模板。我們可以新建乙個資料夾,我這裡是db資料夾,如下 db資料夾下有兩個js,具體 如下 congf...