$1RYHO(UDVXUHEDVHG'DWD7UDQVPLVVLRQIRU
&ORXG6WRUDJH
5DPSUDNDVK.RWD
Senior System Architect, USA, Research Scholar, Department of CSE, ANU, India.
'U5DMDVHNKDUD5DR.XUUD
Director, Sri Prakash College of Engineering (SPCE), Tuni, India.
$EVWUDFW² :LWK WKH FRQWLQXRXV JURZWK LQ WKH FORXG EDVHG DSSOLFDWLRQV DQG VWRUDJH UHTXLUHPHQWV RQ FORXG PDQ\ FRPPHUFLDO FORXG EDVHG VWRUDJH VHUYLFH SURYLGHUV DUH PDNLQJ WKH PDUNHW PRUH FRPSHWLWLYH ZLWK WKH DGYDQFHG WHFKQRORJLHV DQG UHGXFHG FRVW 7KH DGYDQFHPHQW LQ WHFKQRORJLHV DQG KLJK GHPDQG IRU IDXOW WROHUDQW VWRUDJH VROXWLRQV PRVWRIWKHFORXGEDVHGFRPPHUFLDOVWRUDJHVHUYLFHSURYLGHUVDUHQRZHTXLSSHGZLWK(UDVXUHEDVHG5HHG±6RORPRQIDXOW WROHUDQFHPHFKDQLVP+RZHYHUWKHDGGLWLRQDOFRVWIRUUHSOLFDWLRQLVVWLOODQRYHUKHDGIRUVHUYLFHSURYLGHUVDQGFXVWRPHUV :HSURSRVHDWKUHVKROGSUR[\UHHQFU\SWLRQVFKHPHDQGLQWHJUDWHLWZLWKDGHFHQWUDOL]HGHUDVXUHFRGHVXFKWKDWDVHFXUH GLVWULEXWHGVWRUDJHV\VWHPLVIRUPXODWHG7KHGLVWULEXWHGVWRUDJHV\VWHPQRWRQO\VXSSRUWVVHFXUHDQGUREXVWGDWDVWRUDJH DQGUHWULHYDOEXWDOVROHWVDXVHUIRUZDUGKLVGDWDLQWKHVWRUDJHVHUYHUVWRDQRWKHUXVHUZLWKRXWUHWULHYLQJWKHGDWDEDFN 7KH PDLQ WHFKQLFDO FRQWULEXWLRQ LV WKDW WKH SUR[\ UHHQFU\SWLRQ VFKHPH VXSSRUWV HQFRGLQJ RSHUDWLRQV RYHU HQFU\SWHG PHVVDJHVDVZHOODVIRUZDUGLQJRSHUDWLRQVRYHUHQFRGHGDQGHQFU\SWHGPHVVDJHV2XUPHWKRGIXOO\LQWHJUDWHVHQFU\SWLQJ HQFRGLQJDQGIRUZDUGLQJ:HDQDO\VHDQGVXJJHVWVXLWDEOHSDUDPHWHUVIRUWKHQXPEHURIFRSLHVRIDPHVVDJHGLVSDWFKHG WR VWRUDJH VHUYHUV DQG WKH QXPEHU RI VWRUDJH VHUYHUV TXHULHG E\ D NH\ VHUYHU 7KHVH SDUDPHWHUV DOORZ PRUH IOH[LEOH DGMXVWPHQWEHWZHHQWKHQXPEHURIVWRUDJHVHUYHUVDQGUREXVWQHVV
.H\ZRUGV-(UDVXUH5HHG±6RORPRQ&RGH6KRUWHQLQJ3HUIRUPDQFH&RPSDULVRQWKUHVKROGFU\SWRJUDSK\VHFXUHVWRUDJH V\VWHP
, ,1752'8&7,21
,Q WKH SDVW \HDUV WKH KLJK XSFRPLQJ GHPDQG IRU VWRUDJH ZLWK KLJK SHUIRUPDQFH DQG UHOLDELOLW\ ZHUH EHHQ XQGHUVWRRG7KHLQGXVWU\ZDVDSSURDFKLQJWRZDUGVDSKDVHZKHUHWKHODFNRIVWDQGDUGL]DWLRQRIGLJLWDOVWRUDJHZDV OLPLWLQJWKHDSSOLFDWLRQVWRPDNHVWRUDJHPRUHUHOLDEOHIRUFRPPHUFLDOVWRUDJHSURYLGHUV7KHPDMRUERWWOHQHFNIRU WKHVWDQGDUGL]DWLRQZDVWKHQRQVWDQGDUGVWRUDJHVROXWLRQVXVHGE\GLIIHUHQWVHUYLFHSURYLGHUV,QWKHHDUO\¶VWKH LQGXVWU\ DGRSWHG FORXG FRPSXWLQJIRUGLVWULEXWHG VWRUDJH VROXWLRQV 7KH HIIRUWZDVZHOO UHFRJQL]HG DQG PXOWLSOH FRPSDQLHVFDPHWRJHWKHUWRIRUPDFRQVRUWLXPLQRUGHUWRIUDPHWKHVWDQGDUGL]DWLRQIRUGLJLWDOVWRUDJH
$V IDU DV GDWD VWRUDJH LV FRQFHUQHG WKHUH DUH PXOWLSOH VFKHPHV DUH DYDLODEOH WR LPSURYH ILOH DQG GDWD FRPSUHVVLRQ7KHRWKHUPRVWLQIOXHQFLQJSDUDPHWHUV)RULQVWDQFHDGDWDILOHWKDWLVXSORDGHGDQGDFFHVVHGRQWKH VHUYHUPD\VHULRXVO\EHHIIHFWHGE\WKHQHWZRUNEDQGZLGWKDVZHOODVWKHVHUYHUZRUNORDG7KLVZLOOGHJUDGHWKH HIILFLHQF\>@0RUHRYHUWKHFORXGVWRUDJHVHUYLFHVGHDOVZLWKDJUHDWVFRSHDQGGRPDLQRIWKHGDWDEHLQJVWRUDJH DQGUHWULHYHGDORQJZLWKWKHIUHTXHQF\RIDFFHVVYDU\LQJGHSHQGLQJRQWKHPRGHRIWKHRSHUDWLRQSHUIRUPHGRQWKH GDWD>@2IIHULQJXQOLPLWHGVWRUDJHFRQWDLQHUVSDFHPLJKWFDXVHDKLJKHFRQRPLFGUDZEDFNRQWKHFORXGVWRUDJH SURYLGHUDQGDVZHOODVWKHXVHUVGXHWRLQHIILFLHQWVWRUDJH>@+HQFHDWHFKQLTXHRUDXWRPDWLRQLVQHHGHGWRILQG WKHEHVWVXLWDEOHVWRUDJHVWUXFWXUHEDVHGRQFRVWDQGRWKHULQIOXHQFLQJIDFWRUV7KHUHDUHPDQ\IUHHRIIHULQJVRIWKH FORXGVWRUDJHVHUYLFHVKRZHYHUWKH\PD\QRWVXLWHWKHDSSOLFDWLRQUHTXLUHPHQWWRWKHEHVWDOZD\V>@
7ZRPDMRUFRPSDQLHV3KLOLSVDQG6RQ\WRRNWKHPDMRULQLWLDWLYHWRGHILQHWKHVWDQGDUGVWRUDJHIRUPDWVLQGLJLWDO PHGLD7KHVWDQGDUGLVZHOODFFHSWHGWRGD\DQGEHHQUHIHUUHGDVFRPSDFWVWRUDJHIRUPDW7KLVVWDQGDUGIRUPDWLV PDMRUO\XVHGIRUDFKLHYLQJDQ\GDWDZKLFKDOVRUHGXFHVWKHVWRUDJHFRVWFRPSDUHGWRWKHHDUO\VWRUDJHIRUPDWV +RZHYHUWKHFRPSDFWVWRUDJHIRUPDWKDVOLPLWDWLRQVLQRUGHUWRDFKLHYHKLJKDYDLODELOLW\,WLVGLIILFXOWWRSUHGLFW KRZDVWRUDJHPHGLDJHWVFRUUXSWHG,QWKHHDUOLHUVWXGLHVZHKDYHXQGHUVWRRGWKHUHDVRQVIRUVWRUDJHGHYLFHIDLOXUH +HQFHIRUWKZHUHDOLVHWKHIROORZLQJHUURUVIRUVWRUDJHIDLOXUHVDV
7KHDGGLWLRQDOQRLVHDIIHFWLQJWKHVWRUDJHGXULQJWUDQVPLVVLRQRUGXULQJUHWULHYDODQG 0LVKDQGOLQJRIWKHUHPRYDEOHGHYLFHV
DYDLODEOH VWRUDJH VHUYHUV E\ WKH GHFRGLQJ SURFHVV 7KLV SURYLGHV D WUDGH RII EHWZHHQ WKH VWRUDJH VL]H DQG WKH WROHUDQFHWKUHVKROGRIIDLOXUHVHUYHUV$GHFHQWUDOL]HGHUDVXUHFRGHLVDQHUDVXUHFRGHWKDWLQGHSHQGHQWO\FRPSXWHV HDFKFRGHZRUGV\PEROIRUDPHVVDJH7KXVWKHHQFRGLQJSURFHVVIRUDPHVVDJHFDQEHVSOLWLQWRQSDUDOOHOWDVNVRI JHQHUDWLQJFRGHZRUGV\PEROV$GHFHQWUDOL]HGHUDVXUHFRGHLVVXLWDEOHIRUXVHLQDGLVWULEXWHGVWRUDJHV\VWHP$IWHU WKHPHVVDJHV\PEROVDUHVHQWWRVWRUDJHVHUYHUVHDFKVWRUDJHVHUYHULQGHSHQGHQWO\FRPSXWHVDFRGHZRUGV\PERO IRUWKHUHFHLYHGPHVVDJHV\PEROVDQGVWRUHVLW7KLVILQLVKHVWKHHQFRGLQJDQGVWRULQJSURFHVV7KHUHFRYHU\SURFHVV LVWKHVDPH
6WRULQJGDWDLQDWKLUGSDUW\¶VFORXGV\VWHPFDXVHVVHULRXVFRQFHUQRQGDWDFRQILGHQWLDOLW\,QRUGHUWRSURYLGH VWURQJ FRQILGHQWLDOLW\ IRU PHVVDJHV LQ VWRUDJH VHUYHUV D XVHU FDQ HQFU\SW PHVVDJHV E\ D FU\SWRJUDSKLF PHWKRG EHIRUHDSSO\LQJDQHUDVXUHFRGHPHWKRGWRHQFRGHDQGVWRUHPHVVDJHV:KHQKHZDQWVWRXVHDPHVVDJHKHQHHGVWR UHWULHYHWKHFRGHZRUGV\PEROVIURPVWRUDJHVHUYHUVGHFRGHWKHPDQGWKHQGHFU\SWWKHPE\XVLQJFU\SWRJUDSKLF NH\V7KHUHDUHWKUHHSUREOHPVLQWKHDERYHVWUDLJKWIRUZDUGLQWHJUDWLRQRIHQFU\SWLRQDQGHQFRGLQJ)LUVWWKHXVHU KDVWRGRPRVWFRPSXWDWLRQDQGWKHFRPPXQLFDWLRQWUDIILFEHWZHHQWKHXVHUDQGVWRUDJHVHUYHUVLVKLJK6HFRQGWKH XVHU KDV WR PDQDJH KLV FU\SWRJUDSKLF NH\V ,I WKH XVHU¶V GHYLFH RI VWRULQJ WKH NH\V LV ORVW RU FRPSURPLVHG WKH VHFXULW\LVEURNHQ)LQDOO\EHVLGHVGDWDVWRULQJDQGUHWULHYLQJLWLVKDUGIRUVWRUDJHVHUYHUVWRGLUHFWO\VXSSRUWRWKHU IXQFWLRQV )RUH[DPSOH VWRUDJH VHUYHUV FDQQRWGLUHFWO\ IRUZDUG D XVHU¶V PHVVDJHV WRDQRWKHURQH 7KH RZQHURI PHVVDJHVKDVWRUHWULHYHGHFRGHGHFU\SWDQGWKHQIRUZDUGWKHPWRDQRWKHUXVHU
7KHUHVWRIWKHZRUNLVIUDPHGVXFKDVLQ6HFWLRQ,,ZHXQGHUVWDQGWKHFRVWHIIHFWLYHQHVVRIWKHFRPPHUFLDOFORXG VWRUDJH VROXWLRQV LQ 6HFWLRQ ,,, ZH UHDOLVH WKH EDVLF 5HHG 6RORPRQ )DXOW 7ROHUDQFH VFKHPH LQ 6HFWLRQ ,9 ZH SURSRVHWKHQRYHO5HHG±6RORPRQEDVHGFRGHLQ6HFWLRQ9ZHSURSRVHWKHIXUWKHURSWLPL]DWLRQRIWKHSURSRVHG FRGHLQ6HFWLRQ9,ZHGLVFXVVWKHLPSOHPHQWDWLRQDQGUHVXOWVDQGLQ6HFWLRQ9,,ZHFRQFOXGHWKHZRUN
,,&200(5&,$/&/28'6725$*(6(59,&(6
$V WKH FKRLFH RI VWRUDJH VHUYLFHV IURP FORXG LV QRW OLPLWHG DQG PRVW RI WKRVH DUH FRQILJXUHG WR JLYH EHVW DGYDQWDJHVIRUVSHFLILFW\SHRIGDWDDQGRSHUDWLRQZHFRPSDUHPRVWRIWKHVHUYLFHVKHUH>±@.
A. Dropbox
7KH'URSER[LVDVWRUDJHVHUYLFHZKLFKLVDYDLODEOHIRUFOLHQWVLGHDFFHVVIRU:LQGRZVV\VWHPV/LQX[6\VWHPV 0DFLQWRVKV\VWHPV%ODFNEHUU\PRELOHRSHUDWLQJV\VWHPV$QGURLGPRELOHRSHUDWLRQV\VWHPVDQGILQDOO\WKH,3KRQH RSHUDWLQJV\VWHPV7KHIUHH%DVLFDFFRXQWFRPHVZLWKDSDOWU\*%RIVWRUDJH)RUGRFXPHQWEDVHGDSSOLFDWLRQVWKLV LVKXJH7KH6WRUDJHVHUYLFHLVJRRGFKRLFHIRUDSSOLFDWLRQVXVLQJWKHFRQWDLQHUIRUUHDGRQO\GDWD
7DEOH&RVW&RPSDULVRQIRU'URSER[ 'DWD/RDG &RVW /RDGLQ*LJD %\WHV 3ULFHLQ86'ROODUV
86'
86'
86'
86'
86'
1RW$YDLODEOH ! 1RW$YDLODEOH
)LJ&RVW&RPSDULVRQIRU'URSER[
7DEOH6XSSRUWIRU0RELOH%DVHG&ORXG$SSOLFDWLRQVLQ'URSER[ &OLHQW267\SH 6XSSRUW $SSOH,3KRQH2SHUDWLQJ6\VWHPV $YDLODEOH $QGURLG0RELOH2SHUDWLQJ6\VWHPV $YDLODEOH
%ODFNEHUU\2SHUDWLQJ6\VWHPV $YDLODEOH 0LFURVRIW0RELOH2SHUDWLQJ6\VWHP $YDLODEOH
B. Google Drive
7KHPRVWSRSXODUFORXGVWRUDJHVHUYLFHLV'ULYHVWRUDJHIURP*RRJOH7KHEDVLFDFFRXQWFRPHVZLWK*LJD E\WHVRIVWRUDJHIRUDQHZFXVWRPHUDFFRXQWRUDQH[LVWLQJDFFRXQWFUHDWHGZLWK*RRJOH(PDLO7KHKLJKHVWUDWHG EHQHILWRIWKH*RRJOH'ULYHLVWKHVHUYLFHFDQEHDOVREHLQWHJUDWHGZLWKRWKHUH[LVWLQJ*RRJOHVHUYLFHVIRUVWRULQJ YDULRXVW\SHVRIGDWDIURPRWKHUVHUYLFHV
7DEOH&RVW&RPSDULVRQIRU*RRJOH'ULYH 'DWD/RDG &RVW /RDGLQ*LJD%\WHV 3ULFHLQ86'ROODUV
86'
86'
86'
86'
86'
86'
! WR86'
+HUHZHSURYLGHDJUDSKLFDOUHSUHVHQWDWLRQRIWKHFRVWSULFHFRPSDULVRQ
)LJ&RVW&RPSDULVRQIRU*RRJOH'ULYH
ZKLFKFDQEHGLJLWDOO\VLJQHGIRUYHULILFDWLRQV7KHFRUHWHFKQRORJ\EHKLQGWKLVSURYLGHULVOLQNVKDULQJZKHUHWKH VHQGHUFDQXSORDGDILOHDQGWKHOLQNWRWKDWVDPHILOHFDQEHVKDUHGZLWKWKHUHFLSLHQW7KHUHFLSLHQWFDQFOLFNRQWKH OLQNWRGRZQORDGWKHVDPH7KLVVHUYLFHLVSRSXODUIRUEXVLQHVVXVHUVDVLWSURYLGHVWKHSULYDWHFORXGVWRUDJHDQGWKH GHVNWRSYHUVLRQRIWKHFOLHQWZKLFKFDQEHXVHGIRUV\QFLQJORFDOILOHVWRWKHFORXGVWRUDJH
7DEOH&RVW&RPSDULVRQIRU+LJKWDLO 'DWD/RDG &RVW /RDGLQ*LJD%\WHV 3ULFHLQ86'ROODUV
)UHH
)UHH
)UHH
)UHH
)UHH
)UHH
! 86'
7DEOH6XSSRUWIRU0RELOH%DVHG&ORXG$SSOLFDWLRQVLQ+LJKWDLO &OLHQW267\SH 6XSSRUW $SSOH,3KRQH2SHUDWLQJ6\VWHPV $YDLODEOH $QGURLG0RELOH2SHUDWLQJ6\VWHPV 1RW$YDLODEOH
%ODFNEHUU\2SHUDWLQJ6\VWHPV 1RW$YDLODEOH 0LFURVRIW0RELOH2SHUDWLQJ6\VWHP 1RW$YDLODEOH
D. OneDrive
7KH2QH'ULYHZDVSUHYLRXVO\SRSXODUDV6N\'ULYH7KHIXQFWLRQDOLWLHVDUHPRVWO\VDPHDV'URSER[7KHPRVW LPSRUWDQWIDFWRUIRUWKLVVWRUDJHVHUYLFHLVWKDWWKHFOLHQWYHUVLRQLVDYDLODEOHIRU:LQGRZVV\VWHPV/LQX[6\VWHPV 0DFLQWRVKV\VWHPV%ODFNEHUU\PRELOHRSHUDWLQJV\VWHPV$QGURLGPRELOHRSHUDWLRQV\VWHPVDQGILQDOO\WKH,3KRQH RSHUDWLQJV\VWHPV0RUHRYHUWKHVXSSRUWVIRUVRFLDOPHGLDSOXJLQVDUHDOVRDYDLODEOHKHUH7KLVIHDWXUHPDNHVWKH DSSOLFDWLRQPRUHFRPSDWLEOHZLWKRWKHUDSSOLFDWLRQVWRDFFHVVGDWDGLUHFWO\
7DEOH&RVW&RPSDULVRQIRU2QH'ULYH 'DWD/RDG &RVW /RDGLQ*LJD%\WHV 3ULFHLQ86'ROODUV
86'
86'
1RW$YDLODEOH 1RW$YDLODEOH 1RW$YDLODEOH 1RW$YDLODEOH ! 1RW$YDLODEOH
)LJ&RVW&RPSDULVRQ2QH'ULYH
7DEOH6XSSRUWIRU0RELOH%DVHG&ORXG$SSOLFDWLRQVLQ2QH'ULYH &OLHQW267\SH 6XSSRUW $SSOH,3KRQH2SHUDWLQJ6\VWHPV $YDLODEOH $QGURLG0RELOH2SHUDWLQJ6\VWHPV $YDLODEOH %ODFNEHUU\2SHUDWLQJ6\VWHPV $YDLODEOH 0LFURVRIW0RELOH2SHUDWLQJ6\VWHP $YDLODEOH
E. SugarSync
7KH6XJDU6\QFLVPDMRUO\SRSXODUDPRQJEXVLQHVVXVHUVIRULWVHIIHFWLYHDQGIDVWRQOLQHEDFNXSVROXWLRQV7KH VHUYLFH FDQ DOVR EH XVHG IRU FRPSOHWH IROGHU DQG LQGLYLGXDO ILOH V\QFLQJ ZLWK PXOWLSOH DSSOLFDWLRQV DQG PXOWLSOH XVHUV0RUHRYHUWKHVHUYLFHSURYLGHVDXQLTXHIXQFWLRQWRVKDUHWKHVWRUHGFRQWHQWRYHUPXOWLSOHGHYLFHVDWVDPH SRLQWRIWLPHEXWZLWKGLIIHUHQWSHUPLVVLRQOHYHOV7KHPRVWLPSRUWDQWIDFWRUIRUWKLVVWRUDJHVHUYLFHLVWKDWWKHFOLHQW YHUVLRQLVDYDLODEOHIRU$QGURLGPRELOHRSHUDWLRQV\VWHPVDQGDOVRWKH,3KRQHRSHUDWLQJV\VWHPV
7DEOH&RVW&RPSDULVRQIRU6XJHU6\QF
'DWD/RDG &RVW
/RDGLQ*LJD%\WHV 3ULFHLQ86'ROODUV
86'
86'
86'
86'
86'
86'
! 3D\3HU8VH
+HUHZHSURYLGHDJUDSKLFDOUHSUHVHQWDWLRQRIWKHFRVWSULFHFRPSDULVRQ
)LJ&RVW&RPSDULVRQIRU6XJDU6\QF
:KHUH&LVWKHFROOHFWLRQRI&KHFNVXPGHYLFHV
7KHFKHFNVXPGHYLFHVZLOOKROGWKHFDOFXODWHGYDOXHVIURPHDFKUHVSHFWLYHGDWDVWRUDJHGHYLFHV 7KHJRDOLVWRUHVWRUHWKHYDOXHVLIDQ\GHYLFHIURPWKH&FROOHFWLRQIDLOVXVLQJWKHQRQ±IDLOHGGHYLFHV
7KH5HHG±6RORPRQGHSOR\VDIXQFWLRQ*LQRUGHUWRFDOFXODWHWKHFKHFNVXPFRQWHQWIRUHYHU\GHYLFHLQ&+HUHIRU WKLVVWXG\ZHXQGHUVWDQGWKHH[DPSOHRIWKHFDOFXODWLRQZLWKWKHYDOXHVDV. DQG/ IRUWKHGHYLFHV&DQG&
ZLWK*DQG*UHVSHFWLYHO\>@
7KHFRUHIXQFWLRQDOLWLHVRI5HHG±6RORPRQLVWREUHDNWKHFROOHFWLRQRIVWRUDJHGHYLFHVLQQXPEHURIZRUGV>@ >@+HUHLQWKLVH[DPSOHZHXQGHUVWDQGWKHHDFKQXPEHURIZRUGVLVRIXELWVUDQGRPO\+HQFHWKHZRUGVLQHDFK GHYLFHFDQEHDVVXPHGDVYZKHUHYLVGHILQHGDV
bits word v nbytes
byte u Bits
§ ·
§ ·
¨ ¸
¨ ¸
© ¹ © ¹ «(T
)XUWKHUPRUHYLVGHILQHGDV n V
u «(T
+HQFHIRUWKZHXQGHUVWDQGWKHIRUPXODWLRQIRUFKHFNVXPIRUHDFKVWRUDJHGHYLFHDV
i i k
C W D D D D «(T
:KHUHWKHFRGLQJIXQFWLRQ:LVGHILQHGWRRSHUDWHRQHDFKZRUG
$IWHUWKHGHWDLOXQGHUVWDQGLQJRIWKH(UDVXUHIDXOWWROHUDQFHVFKHPHZHKDYHLGHQWLILHGWKHOLPLWDWLRQVRIWKH DSSOLFDELOLW\WRWKHFORXGVWRUDJHVHUYLFHVDQGSURSRVHWKHQRYHOVFKHPHIRUIDXOWWROHUDQFHLQWKLVZRUNLQWKHQH[W
VHFWLRQ
,9352326('129(/)$8/772/(5$1&(6&+(0(
7KH VWDQGDUG IDXOW WROHUDQFH PHFKDQLVP GHSHQGV RQ WKH HUDVXUH FRGHV >@ 7KH EDVLF PHFKDQLVP FDQ EH XQGHUVWRRGLIZHDVVXPHDFROOHFWLRQRIQGLVNVDUHSDUWLWLRQHGLQWRNGLVNV+HQFHWKHUHZLOOEHPGLVNVZKLFKZLOO KROGWKHFRGLQJLQIRUPDWLRQDV r n i i
m
n
k
¦
«(T:KHUHUGHQRWHVQXPEHURINPXOWLSOHRIGLVNV
7KHEDVLFLQWHUSUHWDWLRQRIWKHHUDVXUHFRGHVFDQEHXQGHUVWRRGDVHDFKGLVNPXVWKROGD]ELWZRUGWRUHSUHVHQWWKH FXVWRPHUGDWD,IZHGHQRWHWKHPZLWKGWKHQWKHWRWDOVHWRIFRGHVIRUNQXPEHURIGLVNVDUHFRQVLGHUHGDV
k
z z z
z
«(T$OVRZHFRQVLGHUWKHFRGHVVWRUHGRQHDFKHYHU\PGLVNZLWKFDQGWKHQWKHWRWDOUHSUHVHQWDWLRQLVFRQVLGHUHGDV
k
c c c
c
«(T7KHFRGLQJDQGWKHFXVWRPHUGDWDVKRXOGDOLQHDUFRPELQDWLRQDQGFDQEHUHSUHVHQWHGDV k k k k
m m m k k
c a z a z c a z a z
c a z a z
«(T 7KHFRHIILFLHQWV³D´DUHDOVR]ELWZRUGV(QFRGLQJWKHUHIRUH
A. RAID-4 and RAID-5
7KH5$,'±DQG5$,'±>@DUHWKHVLPSOHVWIRUPRIWKHHUDVXUHFRGHVH[SODLQHGLQWKLVZRUNHDUOLHU5$,' ±DQG5$,'±GLIIHUVIURPWKHEDVLFIUDPHZRUNDVLWHPSOR\VGLIIHUHQWDUUDQJHPHQWVRIGDWDUHSOLFDWLRQ7KH IUDPHZRUNIRU5$,'±DQG5$,'±DUHH[SODLQHGKHUH
7KH5$,'LVDPRGLILFDWLRQWR0'6FRGHZKHUHP DQG] 7KHEDVLFFRGLQJGHSHQGVRQDELWQRWHGDVSZKHUH
k
p
z
z
z
«(T,QFDVHRIDQ\ELWFKDQJLQJWKH;25FRGHZLOOLGHQWLI\LWIRUWKHVXUYLYLQJFRGH
B. Linux RAID-6
7KH/LQX[V\VWHP5$,'±>@>@LVFRQVLGHUHGDVDGGLWLRQDOVXSSRUWWR5$,'±DQG5$,'±DVLWXVHVDQ DOWHUQDWLYHGLVNXQGHUWKHIUDPHZRUN7KLVIUDPHZRUNSURSRVHVDQDOWHUQDWLRQWRWKH0'6DVFRQVLGHULQJWKHFRGHWR EHVWRUHGLQWZRGLVNVDVP +HQFHWKHIRUPXODWLRQLVWRRVLPSOHE\XVLQJDQ;25FRGH
k k
k
p z z z
q z z z
«(T
+HUHWKHFRGHVFDOOHGSDQGTZLOOEHVWRUHGRQDOWHUQDWLYHGLVNVWRHQVXUHWKH(UDVXUHFRGHWRSURWHFWWKHGDWDORVV
C. Array Codes
7KHIUDPHZRUNLVFDOOHG$UUD\FRGHDVLWLVLPSOHPHQWHGXVLQJU;QDUUD\RIFXVWRPHUGDWD,QWKLVIUDPHZRUNWKH FXVWRPHUGDWDZLOOEHVWRUHGZLWKWKHDUUDQJHPHQWVDV)LJXUH
)LJ$UUD\&RGH6WRUDJH
7KHDUUD\FRGHZLWKWKHIROORZLQJSDUDPHWHUVN P 5$,'Q NP U ]
D. Non-MDS Codes
7KH 1RQ0'6 FRGHV GR QRW DOORZ UHSOLFDWLRQ RI P VWRUDJH GHYLFHV WR DFKLHYH RSWLPDO IDXOW WROHUDQFH 7KH UHSOLFDWLRQ RI VWRUDJH GHYLFHV FRQWDLQLQJ WKH FRGH LV KLJKHU WKDQ WKH RWKHU IUDPHZRUNV +RZHYHU WKH HIILFLHQF\ SURYLGHG E\ WKH 1RQ0'6 FRGHV FRPSDUHG WR RWKHU IUDPHZRUNV LQ WHUPV RI SHUIRUPDQFH LV KLJK +HQFH ZH FRPSDUHDOOWKHW\SHVRIFRGHIUDPHZRUNVKHUH
:LWK WKH XQGHUVWDQGLQJ RI WKH OLPLWDWLRQV RI H[LVWLQJ HUDVXUH FRGHV WR EH DSSOLHG RQ WKH FORXG EDVHG VWRUDJH V\VWHPV DV WKH FRPSOH[ FDOFXODWLRQV ZLWK HUDVXUH FRGHV ZLOO UHGXFH WKH SHUIRUPDQFH RI DYDLODELOLW\ PHDVXUHV VLJQLILFDQWO\7KXVZHPDNHDQDWWHPSWWRUHGXFHWKHFDOFXODWLRQFRPSOH[LWLHVZLWKVLPSOHPDWKHPDWLFDORSHUDWLRQV LQWKHVWDQGDUGHUDVXUHVFKHPH
7KH FKHFNVXP IRU VWRUDJH GHYLFHV DUH FRQVLGHUHG DV &L IURP WKH (T :H SURSRVH WKH HQKDQFHPHQW DV WKH
IROORZLQJIRUPXODWLRQIRUFKHFNVXPFDOFXODWLRQ
i i k i k
C W D D D D W D D D D «(T
9237,0,=,1*352326('129(/)$8/772/(5$1&(6&+(0(
7KH 5HHG 6RORPRQ FRGH LV H[SUHVVHG E\ WKH SRZHU RI FRHIILFLHQW GHQRWHG E\ Q IRU WKH GDWD EORFNV, where n is expressed as
n = m …(T
and the code blocks are represented as m
k t …(T
Where m represents the number of bits per data and t represents the capability of correcting errors. In general the Reed – Solomon code considers an 8 bit data and 2 bit code, the error correcting code can be represented as (255,251) code.
Here in this part of the work, we try to optimize the code length further to reduce the replication cost. The steps of the optimization algorithm are explained here:
6WHS First we consider the effective code in (255,251) block, where the code is consisting of zero and non-zero codes.
6WHS Then we find the number of zero codes in the segment. For instance the numbers of zero codes are 227 in the code block. These codes will not have any effect in the error correction and fault tolerance mechanism.
6WHS Then we find the effective block of the code as (28,24) for a 2 bit error correction code. 6WHS Hence as a final outcome of the optimization technique, we got the optimized code block.
9,,03/(0(17$7,21$1'5(68/76
7RVLPXODWHDQGXQGHUVWDQGWKHLPSURYHPHQWLQWKHRXWFRPHVZHLPSOHPHQWWKH5HHG±6RORPRQFRGHZLWKWKH HQKDQFHPHQWDQGRSWLPL]DWLRQSURSRVHGLQWKLVZRUN.:HDFFHSWDQ\UDQGRPGDWDDVWKHLQLWLDOGDWDEORFNIRUWKH WHVWLQJ>7DEOH@
7DEOH,QLWLDO'DWD%ORFN
Based on the modified fault tolerance scheme, we realise the addition and multiplication table >7DEOH @
7DEOH$GGLWLRQ7DEOH DADADADADADADADADADADADADADADA _DADADADADADADADADADADADADADADA DA_DADADADADADADADADADADADADADA DA DA_DADADADADADADADADADADADADADA DA DA_DADADADADADADADADADADADADADA DA DA_DADADADADADADADADADADADADADA DA DA_DADADADADADADADADADADADADADA DA DA_DADADADADADADADADADADADADADA DA DA_DADADADADADADADADADADADADADA DA DA_DADADADADADADADADADADADADADA DA DA_DADADADADADADADADADADADADADA DA DA_DADADADADADADADADADADADADADA DA DA_DADADADADADADADADADADADADADA DA DA_DADADADADADADADADADADADADADA DA DA_DADADADADADADADADADADADADADA DA DA_DADADADADADADADADADADADADADA DA DA_DADADADADADADADADADADADADADADA
7DEOH08/7,3/,&$7,217$%/(
DADADADADADADADADADADADADADADA _ DA_DADADADADADADADADADADADADADA DA DA_DADADADADADADADADADADADADADA DA DA_DADADADADADADADADADADADADADA DA DA_DADADADADADADADADADADADADADA DA DA_DADADADADADADADADADADADADADA DA DA_DADADADADADADADADADADADADADA DA DA_DADADADADADADADADADADADADADA DA DA_DADADADADADADADADADADADADADA DA DA_DADADADADADADADADADADADADADA DA DA_DADADADADADADADADADADADADADA DA DA_DADADADADADADADADADADADADADA DA DA_DADADADADADADADADADADADADADA DA DA_DADADADADADADADADADADADADADA DA DA_DADADADADADADADADADADADADADA DA DA_DADADADADADADADADADADADADADA DA
,QWKLVZRUNWKHFRPPHUFLDOFORXGVWRUDJHVHUYLFHVDUHEHHQFRPSDUHGEDVHGRQWKHFRVWDQGSHUIRUPDQFHIDFWRUV 7KH UHVXOW RI WKH FRPSDUDWLYH PHDVXUHV SURYLGHG WKH XQGHUVWDQGLQJ RI WKH GHPDQG IRU KLJKO\ UHOLDEOH DQG FRVW HIIHFWLYH IDXOW WROHUDQFH V\VWHP +HQFHIRUWK LQ WKLV ZRUN ZH VWXG\ WKH FRUH 5HHG 6RORPRQ IDXOW WROHUDQFH PHFKDQLVP EDVHG RQ (UDVXUH FRGHV 7KH ZRUN FRQWULEXWHV WRZDUGV WKH LPSURYHG SHUIRUPDQFH FRGH IRU IDXOW WROHUDQFHIRUGLJLWDOVWRUDJHGHYLFHVUDWKHUWKDQPDJQHWLF$OVRWKHZRUNHQKDQFHGWKHSHUIRUPDQFHRIWKHSURSRVHG WHFKQLTXHE\DSSO\LQJWKHLPSURYHPHQWLQWHUPVRIRSWLPL]DWLRQ7KHUHVXOWRIWKHSURSRVHGRSWLPL]DWLRQWHFKQLTXH LVUHGXFWLRQLQWKHVWRUDJHFRVWZLWKRXWQHJRWLDWLQJZLWKWKHIDXOWWROHUDQFHUHOLDELOLW\.
5()(5(1&(6
>@ :=KDQJ+:DQJDQG%3DQ5HGXFHGFRPSOH[LW\/&&5HHG6RORPRQGHFRGHUEDVHGRQXQLILHGV\QGURPHFRPSXWDWLRQ,((( 7UDQV9HU\/DUJH6FDOH,QWHJU9/6,6\VWYROQRSS
>@ - %HOORUDGR DQG $ .DYFLF /RZFRPSOH[LW\ VRIWGHFRGLQJ DOJRULWKPV IRU 5HHG6RORPRQ FRGHV $Q DOJHEUDLF VRIWLQ KDUGRXW FKDVH GHFRGHU,(((7UDQV,QI7KHRU\YROQRSS
>@ -.XELDWRZLF]'%LQGHO<&KHQ3(DWRQ'*HHOV5*XPPDGL65KHD+:HDWKHUVSRRQ::HLPHU&:HOOVDQG%=KDR ³2FHDQVWRUH $Q $UFKLWHFWXUH IRU *OREDO6FDOH 3HUVLVWHQW 6WRUDJH´ 3URF 1LQWK ,QW¶O &RQI $UFKLWHFWXUDO 6XSSRUW IRU 3URJUDPPLQJ /DQJXDJHVDQG2SHUDWLQJ6\VWHPV$63/26SS
>@ 3 'UXVFKHO DQG $ 5RZVWURQ ³3$67 $ /DUJH6FDOH 3HUVLVWHQW 3HHUWR3HHU 6WRUDJH 8WLOLW\´ 3URF (LJKWK :RUNVKRS +RW 7RSLFV LQ 2SHUDWLQJ6\VWHP+RW269,,,SS
>@ $$G\D:-%RORVN\0&DVWUR*&HUPDN5&KDLNHQ-5'RXFHXU-+RZHOO-5/RUFK07KHLPHUDQG5:DWWHQKRIHU³)DUVLWH )HGHUDWHG$YDLODEOHDQG5HOLDEOH6WRUDJHIRUDQ,QFRPSOHWHO\7UXVWHG(QYLURQPHQW´3URF)LIWK6\PS2SHUDWLQJ6\VWHP'HVLJQDQG ,PSOHPHQWDWLRQ26',SS
>@ $+DHEHUOHQ$0LVORYHDQG3'UXVFKHO³*ODFLHU+LJKO\'XUDEOH'HFHQWUDOL]HG6WRUDJH'HVSLWH0DVVLYH&RUUHODWHG)DLOXUHV´3URF 6HFRQG6\PS1HWZRUNHG6\VWHPV'HVLJQDQG,PSOHPHQWDWLRQ16',SS
>@ 47DQJ³7\SH%DVHG3UR[\5H(QFU\SWLRQDQG,WV&RQVWUXFWLRQ´3URF1LQWK,QW¶O&RQI&U\SWRORJ\LQ,QGLD3URJUHVVLQ&U\SWRORJ\ ,1'2&5<37SS
>@ *$WHQLHVH.%HQVRQDQG6+RKHQEHUJHU³.H\3ULYDWH3UR[\5H(QFU\SWLRQ´3URF7RSLFVLQ&U\SWRORJ\&756$SS
>@ - 6KDR DQG = &DR ³&&$6HFXUH 3UR[\ 5H(QFU\SWLRQ ZLWKRXW 3DLULQJV´ 3URF WK ,QW¶O &RQI 3UDFWLFH DQG 7KHRU\ LQ 3XEOLF .H\ &U\SWRJUDSK\3.&SS
>@)*DUF+HUUHUR-9DOOVDQG3.0HKHU+LJKVSHHG56GHFRGHUEDVHGRQ/&&GHFRGLQJ&LUFXLWV6\VW6LJQDO3URFHVV YROQRSS
>@--LDQJDQG.51DUD\DQDQ$OJHEUDLFVRIWGHFLVLRQGHFRGLQJRI5HHG6RORPRQFRGHVXVLQJELWOHYHOVRIWLQIRUPDWLRQ,(((7UDQV ,QI7KHRU\YROQRSS