1. Become a Developer for CrazyEngineers - Apply NOW!

Data deduplication Vs cloud data availability

Discussion in 'Computer Science | IT | Networking' started by kalaivani M, Aug 4, 2014.

  1. kalaivani M

    kalaivani M Apprentice

    Engineering Discipline:
    Computer Science
    can anyone clear my doubt.

    will the data deduplication affect cloud data availability??

    cloud service providers to manage their storage they are looking for deduplicating their similar content.

    if they do so, their storage may get freed but , will not this affect the availability of the data in cloud . that is only to have multiple copies we are prefering cloud but this is just reverse of our expectation.

    so is deduplication and cloud availability terms are trade offs??
     
    Last edited: Aug 4, 2014
  2. CrazyEngineeres T-Shirts Store
  3. Kaustubh Katdare

    Kaustubh Katdare Administrator

    Engineering Discipline:
    Electrical
    You need to explain your question a bit in more detail.
     
  4. Anoop Kumar

    Anoop Kumar Knight

    Engineering Discipline:
    IT
    Not sure about deduplicate you're refering to, but according to definition
    "data deduplication is a specialized data compression technique for eliminating duplicate copies of repeating data."

    Here I found the technique on this blog
    1. Divide the input data into blocks or “chunks.”
    2. Calculate a hash value for each block of data.
    3. Use these values to determine if another block of the same data has already been stored.
    4. Replace the duplicate data with a reference to the object already in the database.
    By the above process, cloud is not removing the data but it is placing most updated data and cleaning out old one.Which seems reasonable.
     
    • Informative Informative x 1
  5. kalaivani M

    kalaivani M Apprentice

    Engineering Discipline:
    Computer Science
     
  6. kalaivani M

    kalaivani M Apprentice

    Engineering Discipline:
    Computer Science
    thanks for ur reply kumar,

    actually i know the concept of deduplication. the thing is if it replaces the second similar file by pointer. then there will be no similar storage of the original data. just pointer will be there to refer the original one.

    if the original copy gets damaged by natural disaster / compromised by hacker.
    then we are left under risk because we will just be having only the pointer that will be pointing the currently corrupted file.

    so here cloud's availability gets failed.

    then what is the reason so special to consider this deduplication as important??
     
  7. Gollapinni Karthik Sharma

    Gollapinni Karthik Sharma Enthusiast

    Engineering Discipline:
    Computer Science
    • Like Like x 1
  8. kalaivani M

    kalaivani M Apprentice

    Engineering Discipline:
    Computer Science
    thanks for ur idea karhtik,

    i have read those contents.

    but my doubt is cloud availability will get decrease or not ??

    due to this deduplication..
     
  9. Gollapinni Karthik Sharma

    Gollapinni Karthik Sharma Enthusiast

    Engineering Discipline:
    Computer Science
    @kalaivani M : Yes cloud availability will obviously increase. These make chances to increase the availability always.

    Deduplication is an another concept of the cloud. With these back up techniques and all they make the cloud availability more often and increase the chances to make it secure also.

    They use the techniques of Raid 0 to 6 which are the fastest data copying techniques.
     
  10. kalaivani M

    kalaivani M Apprentice

    Engineering Discipline:
    Computer Science
    oh..! well..i got some idea, thanks for your help karthik. thank you CE..!
     

Share This Page