按行分割大資料量檔案的bash指令碼實現

資料庫幾十萬的資料量備份下來後，如果直接在乙個檔案中匯入到資料庫中，無法實現。於是就想著怎麼按行把資料檔案分割成小塊檔案操作。

仔細看了下linux的head和tail指令，發現很符合按行分割的要求。接下來就是如何設計乙個shell指令碼，自動完成分割工作了。

指令碼如下（無論資料量多大，分割成50000條乙個檔案）：

divide.sh

#!/bin/bash echo "please input the large file:" read largefile echo "please input the output file(will add '.unl' automaticlly):" read outputfile #each file index,begin with 1 index=1 #divide size,each size 50000 size=50000 eachsize=50000 #get the largefile size filesize=`wc -l $largefile | awk ''` echo "the large file size is $filesize." #divide the large file to divided files, 50000 by 50000 while [ $size -lt $filesize ] do tempfile=`echo "$outputfile$index.unl"` echo "outputfile is made:$tempfile" index=`expr $index + 1` rm $tempfile touch $tempfile echo "head -$size $largefile | tail -$eachsize - > $tempfile" `head -$size $largefile | tail -$eachsize - > $tempfile` #next 50000 size=`expr $size + 50000` done #some oddment size=`expr $size - 50000` oddment=`expr $filesize - $size` if [ $oddment -gt 0 ]; then tempfile=`echo "$outputfile$index.unl"` echo "outputfile is made:$tempfile" rm $tempfile touch $tempfile echo "tail -$oddment $largefile > $tempfile" `tail -$oddment $largefile > $tempfile` fiecho "divide over!"

shell執行效果（以分割666666行記錄的mytable.unl資料庫匯出檔案為例）：

linux /home/user123/test> ./divide.sh please input the large file: mytable.unl please input the output file(will add '.unl' automaticlly): test the large file size is 666666. outputfile is made:test1.unl head -50000 mytable.unl | tail -50000 - > test1.unl outputfile is made:test2.unl head -100000 mytable.unl | tail -50000 - > test2.unl outputfile is made:test3.unl head -150000 mytable.unl | tail -50000 - > test3.unl outputfile is made:test4.unl head -200000 mytable.unl | tail -50000 - > test4.unl outputfile is made:test5.unl head -250000 mytable.unl | tail -50000 - > test5.unl outputfile is made:test6.unl head -300000 mytable.unl | tail -50000 - > test6.unl outputfile is made:test7.unl head -350000 mytable.unl | tail -50000 - > test7.unl outputfile is made:test8.unl head -400000 mytable.unl | tail -50000 - > test8.unl outputfile is made:test9.unl head -450000 mytable.unl | tail -50000 - > test9.unl outputfile is made:test10.unl head -500000 mytable.unl | tail -50000 - > test10.unl outputfile is made:test11.unl head -550000 mytable.unl | tail -50000 - > test11.unl outputfile is made:test12.unl head -600000 mytable.unl | tail -50000 - > test12.unl outputfile is made:test13.unl head -650000 mytable.unl | tail -50000 - > test13.unl outputfile is made:test14.unl tail -16666 mytable.unl > test14.unl

divide over!

最後不滿50000條的，單獨記錄到乙個新的檔案中。

按行分割大資料量檔案的bash指令碼實現

imp匯入大資料量檔案

大資料量的處理

大資料量mysql檔案匯入程式

按行分割大資料量檔案的bash指令碼實現

imp匯入大資料量檔案

大資料量的處理

大資料量mysql檔案匯入程式

相關推薦