fbpx
Wikipedia

Object storage

Object storage (also known as object-based storage[1] or blob storage) is a computer data storage approach that manages data as "blobs" or "objects", as opposed to other storage architectures like file systems which manages data as a file hierarchy, and block storage which manages data as blocks within sectors and tracks.[2] Each object is typically associated with a variable amount of metadata, and a globally unique identifier. Object storage can be implemented at multiple levels, including the device level (object-storage device), the system level, and the interface level. In each case, object storage seeks to enable capabilities not addressed by other storage architectures, like interfaces that are directly programmable by the application, a namespace that can span multiple instances of physical hardware, and data-management functions like data replication and data distribution at object-level granularity.

Object storage systems allow retention of massive amounts of unstructured data in which data is written once and read once (or many times).[3] Object storage is used for purposes such as storing objects like videos and photos on Facebook, songs on Spotify, or files in online collaboration services, such as Dropbox.[4] One of the limitations with object storage is that it is not intended for transactional data, as object storage was not designed to replace NAS file access and sharing; it does not support the locking and sharing mechanisms needed to maintain a single, accurately updated version of a file.[3]

History edit

Origins edit

Jim Starkey coined the term "blob"[when?] working at Digital Equipment Corporation to refer to opaque data entities. The terminology was adopted for Rdb/VMS. "Blob" is often humorously explained to be an abbreviation for "binary large object". According to Starkey, this backronym arose when Terry McKiever, working in marketing at Apollo Computer felt that the term needed to be an abbreviation. McKiever began using the expansion "Basic Large Object". This was later eclipsed by the retroactive explanation of blobs as "Binary Large Objects". According to Starkey, "Blob don't stand for nothin'." Rejecting the acronym, he explained his motivation behind the coinage, saying, "A blob is the thing that ate Cincinnatti [sic], Cleveland, or whatever," referring to the 1958 science fiction film The Blob.[5]

In 1995, research led by Garth Gibson on Network-Attached Secure Disks first promoted the concept of splitting less common operations, like namespace manipulations, from common operations, like reads and writes, to optimize the performance and scale of both.[6] In the same year, a Belgian company - FilePool - was established to build the basis for archiving functions. Object storage was proposed at Gibson's Carnegie Mellon University lab as a research project in 1996.[7] Another key concept was abstracting the writes and reads of data to more flexible data containers (objects). Fine grained access control through object storage architecture[8] was further described by one of the NASD team, Howard Gobioff, who later was one of the inventors of the Google File System.[9]

Other related work includes the Coda filesystem project at Carnegie Mellon, which started in 1987, and spawned the Lustre file system.[10] There is also the OceanStore project at UC Berkeley,[11] which started in 1999[12] and the Logistical Networking project at the University of Tennessee Knoxville, which started in 1998.[13] In 1999, Gibson founded Panasas to commercialize the concepts developed by the NASD team.

Development edit

Seagate Technology played a central role in the development of object storage. According to the Storage Networking Industry Association SNIA, "Object storage originated in the late 1990s: Seagate specifications from 1999 Introduced some of the first commands and how operating system effectively removed from consumption of the storage."[14]

A preliminary version of the "OBJECT BASED STORAGE DEVICES Command Set Proposal" dated 10/25/1999 was submitted by Seagate as edited by Seagate's Dave Anderson and was the product of work by the National Storage Industry Consortium (NSIC) including contributions by Carnegie Mellon University, Seagate, IBM, Quantum, and StorageTek.[15] This paper was proposed to INCITS T-10 (International Committee for Information Technology Standards) with a goal to form a committee and design a specification based on the SCSI interface protocol.  This defined objects as abstracted data, with unique identifiers and metadata, how objects related to file systems, along with many other innovative concepts. Anderson presented many of these ideas at the SNIA conference in October 1999.  The presentation revealed an IP Agreement that had been signed in February 1997 between the original collaborators (with Seagate represented by Anderson and Chris Malakapalli) and covered the benefits of object storage, scalable computing, platform independence, and storage management.[16]

Architecture edit

 

Abstraction of storage edit

One of the design principles of object storage is to abstract some of the lower layers of storage away from the administrators and applications. Thus, data is exposed and managed as objects instead of blocks or (exclusively) files. Objects contain additional descriptive properties which can be used for better indexing or management. Administrators do not have to perform lower-level storage functions like constructing and managing logical volumes to utilize disk capacity or setting RAID levels to deal with disk failure.

Object storage also allows the addressing and identification of individual objects by more than just file name and file path. Object storage adds a unique identifier within a bucket, or across the entire system, to support much larger namespaces and eliminate name collisions.

Inclusion of rich custom metadata within the object edit

Object storage explicitly separates file metadata from data to support additional capabilities. As opposed to fixed metadata in file systems (filename, creation date, type, etc.), object storage provides for full function, custom, object-level metadata in order to:

  • Capture application-specific or user-specific information for better indexing purposes
  • Support data-management policies (e.g. a policy to drive object movement from one storage tier to another)
  • Centralize management of storage across many individual nodes and clusters
  • Optimize metadata storage (e.g. encapsulated, database or key value storage) and caching/indexing (when authoritative metadata is encapsulated with the metadata inside the object) independently from the data storage (e.g. unstructured binary storage)

Additionally, in some object-based file-system implementations:

  • The file system clients only contact metadata servers once when the file is opened and then get content directly via object-storage servers (vs. block-based file systems which would require constant metadata access)
  • Data objects can be configured on a per-file basis to allow adaptive stripe width, even across multiple object-storage servers, supporting optimizations in bandwidth and I/O

Object-based storage devices (OSD) as well as some software implementations (e.g., DataCore Swarm) manage metadata and data at the storage device level:

  • Instead of providing a block-oriented interface that reads and writes fixed sized blocks of data, data is organized into flexible-sized data containers, called objects
  • Each object has both data (an uninterpreted sequence of bytes) and metadata (an extensible set of attributes describing the object); physically encapsulating both together benefits recoverability.
  • The command interface includes commands to create and delete objects, write bytes and read bytes to and from individual objects, and to set and get attributes on objects
  • Security mechanisms provide per-object and per-command access control

Programmatic data management edit

Object storage provides programmatic interfaces to allow applications to manipulate data. At the base level, this includes Create, read, update and delete (CRUD) functions for basic read, write and delete operations. Some object storage implementations go further, supporting additional functionality like object/file versioning, object replication, life-cycle management and movement of objects between different tiers and types of storage. Most API implementations are REST-based, allowing the use of many standard HTTP calls.

Implementation edit

Cloud storage edit

The vast majority of cloud storage available in the market leverages an object-storage architecture. Some notable examples are Amazon Web Services S3, which debuted in March 2006, Microsoft Azure Blob Storage, Rackspace Cloud Files (whose code was donated in 2010 to Openstack project and released as OpenStack Swift), and Google Cloud Storage released in May 2010.

Object-based file systems edit

Some distributed file systems use an object-based architecture, where file metadata is stored in metadata servers and file data is stored in object storage servers. File system client software interacts with the distinct servers, and abstracts them to present a full file system to users and applications.

Object-storage systems edit

Some early incarnations of object storage were used for archiving, as implementations were optimized for data services like immutability, not performance. EMC Centera and Hitachi HCP (formerly known as HCAP) are two commonly cited object storage products for archiving. Another example is Quantum ActiveScale Object Storage Platform.

More general-purpose object-storage systems came to market around 2008. Lured by the incredible growth of "captive" storage systems within web applications like Yahoo Mail and the early success of cloud storage, object-storage systems promised the scale and capabilities of cloud storage, with the ability to deploy the system within an enterprise, or at an aspiring cloud-storage service provider.

Unified file and object storage edit

A few object-storage systems support Unified File and Object storage, allowing clients to store objects on a storage system while simultaneously other clients store files on the same storage system.[17] Other vendors in the area of Hybrid cloud storage are using Cloud storage gateways to provide a file access layer over object storage, implementing file access protocols such as SMB and NFS.

"Captive" object storage edit

Some large Internet companies developed their own software when object-storage products were not commercially available or use cases were very specific. Facebook famously invented their own object-storage software, code-named Haystack, to address their particular massive-scale photo management needs efficiently.[18]

Object-based storage devices edit

Object storage at the protocol and device layer was proposed 20 years ago[ambiguous] and approved for the SCSI command set nearly 10 years ago[ambiguous] as "Object-based Storage Device Commands" (OSD),[19] however, it had not been put into production until the development of the Seagate Kinetic Open Storage platform.[20][21] The SCSI command set for Object Storage Devices was developed by a working group of the SNIA for the T10 committee of the International Committee for Information Technology Standards (INCITS).[22] T10 is responsible for all SCSI standards.

Market adoption edit

One of the first object-storage products, Lustre, is used in 70% of the Top 100 supercomputers and ~50% of the Top 500.[23] As of June 16, 2013, this includes 7 of the top 10, including the current fourth fastest system on the list - China's Tianhe-2 and the seventh fastest, the Titan supercomputer at the Oak Ridge National Laboratory.[24]

Object-storage systems had good adoption in the early 2000s as an archive platform, particularly in the wake of compliance laws like Sarbanes-Oxley. After five years in the market, EMC's Centera product claimed over 3,500 customers and 150 petabytes shipped by 2007.[25] Hitachi's HCP product also claims many petabyte-scale customers.[26] Newer object storage systems have also gotten some traction, particularly around very large custom applications like eBay's auction site, where EMC Atmos is used to manage over 500 million objects a day.[27] As of March 3, 2014, EMC claims to have sold over 1.5 exabytes of Atmos storage.[28] On July 1, 2014, Los Alamos National Lab chose the Scality RING as the basis for a 500-petabyte storage environment, which would be among the largest ever.[29]

"Captive" object storage systems like Facebook's Haystack have scaled impressively. In April 2009, Haystack was managing 60 billion photos and 1.5 petabytes of storage, adding 220 million photos and 25 terabytes a week.[18] Facebook more recently stated that they were adding 350 million photos a day and were storing 240 billion photos.[30] This could equal as much as 357 petabytes.[31]

Cloud storage has become pervasive as many new web and mobile applications choose it as a common way to store binary data.[32] As the storage back-end to many popular applications like Smugmug and Dropbox, Amazon S3 has grown to massive scale, citing over 2-trillion objects stored in April 2013.[33] Two months later, Microsoft claimed that they stored even more objects in Azure at 8.5 trillion.[34] By April 2014, Azure claimed over 20-trillion objects stored.[35] Windows Azure Storage manages Blobs (user files), Tables (structured storage), and Queues (message delivery) and counts them all as objects.[36]

Market analysis edit

IDC has begun to assess the object-based-storage market annually using its MarketScape methodology. IDC describes the MarketScape as: "...a quantitative and qualitative assessment of the characteristics that assess a vendor's current and future success in the said market or market segment and provide a measure of their ascendancy to become a Leader or maintain a leadership. IDC MarketScape assessments are particularly helpful in emerging markets that are often fragmented, have several players, and lack clear leaders."[37]

In 2019, IDC rated Dell EMC, Hitachi Data Systems, IBM, NetApp, and Scality as leaders.

Standards edit

Object-based storage device standards edit

OSD version 1 edit

In the first version of the OSD standard,[38] objects are specified with a 64-bit partition ID and a 64-bit object ID. Partitions are created and deleted within an OSD, and objects are created and deleted within partitions. There are no fixed sizes associated with partitions or objects; they are allowed to grow subject to physical size limitations of the device or logical quota constraints on a partition.

An extensible set of attributes describe objects. Some attributes are implemented directly by the OSD, such as the number of bytes in an object and the modification time of an object. There is a special policy tag attribute that is part of the security mechanism. Other attributes are uninterpreted by the OSD. These are set on objects by the higher-level storage systems that use the OSD for persistent storage. For example, attributes might be used to classify objects, or to capture relationships among different objects stored on different OSDs.

A list command returns a list of identifiers for objects within a partition, optionally filtered by matches against their attribute values. A list command can also return selected attributes of the listed objects.

Read and write commands can be combined, or piggy-backed, with commands to get and set attributes. This ability reduces the number of times a high-level storage system has to cross the interface to the OSD, which can improve overall efficiency.

OSD version 2 edit

A second generation of the SCSI command set, "Object-Based Storage Devices - 2" (OSD-2) added support for snapshots, collections of objects, and improved error handling.[39]

A snapshot is a point-in-time copy of all the objects in a partition into a new partition. The OSD can implement a space-efficient copy using copy-on-write techniques so that the two partitions share objects that are unchanged between the snapshots, or the OSD might physically copy the data to the new partition. The standard defines clones, which are writeable, and snapshots, which are read-only.

A collection is a special kind of object that contains the identifiers of other objects. There are operations to add and delete from collections, and there are operations to get or set attributes for all the objects in a collection. Collections are also used for error reporting. If an object becomes damaged by the occurrence of a media defect (i.e., a bad spot on the disk) or by a software error within the OSD implementation, its identifier is put into a special error collection. The higher-level storage system that uses the OSD can query this collection and take corrective action as necessary.

Differences between key-value and object stores edit

The border between an object store and a key-value store is blurred, with key-value stores being sometimes loosely referred to as object stores.

A traditional block storage interface uses a series of fixed size blocks which are numbered starting at 0. Data must be that exact fixed size and can be stored in a particular block which is identified by its logical block number (LBN). Later, one can retrieve that block of data by specifying its unique LBN.

With a key-value store, data is identified by a key rather than a LBN. A key might be "cat" or "olive" or "42". It can be an arbitrary sequence of bytes of arbitrary length. Data (called a value in this parlance) does not need to be a fixed size and also can be an arbitrary sequence of bytes of arbitrary length. One stores data by presenting the key and data (value) to the data store and can later retrieve the data by presenting the key. This concept is seen in programming languages. Python calls them dictionaries, Perl calls them hashes, Java and C++ call them maps, etc. Several data stores also implement key-value stores such as Memcached, Redis and CouchDB.

Object stores are similar to key-value stores in two respects. First, the object identifier or URL (the equivalent of the key) can be an arbitrary string.[40] Second, data may be of an arbitrary size.

There are, however, a few key differences between key-value stores and object stores. First, object stores also allow one to associate a limited set of attributes (metadata) with each piece of data. The combination of a key, value, and set of attributes is referred to as an object. Second, object stores are optimized for large amounts of data (hundreds of megabytes or even gigabytes), whereas for key-value stores the value is expected to be relatively small (kilobytes). Finally, object stores usually offer weaker consistency guarantees such as eventual consistency, whereas key-value stores offer strong consistency.

See also edit

References edit

  1. ^ Mesnier, M.; Ganger, G.R.; Riedel, E. (August 2003). "Storage area networking - Object-based storage". IEEE Communications Magazine. 41 (8): 84–90. doi:10.1109/mcom.2003.1222722.
  2. ^ Porter De Leon, Yadin; Tony Piscopo (14 August 2014). "Object Storage versus Block Storage: Understanding the Technology Differences". Druva. Retrieved 19 January 2015.
  3. ^ a b Erwin, Derek (2022). "Block Storage vs. Object Storage vs. File Storage: What's the Difference?". Qumulo. Retrieved 8 February 2022. Object storage can work well for unstructured data in which data is written once and read once (or many times). Static online content, data backups, image archives, videos, pictures, and music files can be stored as objects.
  4. ^ Chandrasekaran, Arun; Dayley, Alan (11 February 2014). . Gartner Research. Archived from the original on 2014-03-16.
  5. ^ Starkey, James (1997-01-22). "The true story of BLOBs". Retrieved 2023-11-08.
  6. ^ Garth A. Gibson; Nagle D.; Amiri K.; Chan F.; Feinberg E.; Gobioff H.; Lee C.; Ozceri B.; Riedel E.; Rochberg D.; Zelenka J. "File Server Scaling with Network-Attached Secure Disks" (PDF). Proceedings of the ACM International Conference on Measurement and Modeling of Computer Systems (Sigmetrics '97). Retrieved 27 October 2013.
  7. ^ Factor, Michael; Meth, K.; Naor, D.; Rodeh, O.; Satran, J. (2005). "Object Storage: The Future Building Block for Storage Systems". pp. 119–123. CiteSeerX 10.1.1.122.3959.
  8. ^ Gobioff, Howard; Gibson, Garth A.; Tygar, Doug (1 October 1997). "Security for Network Attached Storage Devices (CMU-CS-97-185)". Parallel Data Laboratory. Retrieved 7 November 2013.
  9. ^ Sanjay Ghemawat; Howard Gobioff; Shun-Tak Leung (October 2003). "The Google File System" (PDF). Retrieved 7 November 2013.
  10. ^ Braam, Peter. "Lustre: The intergalactic file system" (PDF). Retrieved 17 September 2013.
  11. ^ . Archived from the original on 8 August 2012. Retrieved 18 September 2013.
  12. ^ Kubiatowicz, John; Wells, Chris; Zhao, Ben; Bindel, David; Chen, Yan; Czerwinski, Steven; Eaton, Patrick; Geels, Dennis; Gummadi, Ramakrishna; Rhea, Sean; Weatherspoon, Hakim (2000). "OceanStore: An architecture for global-scale persistent storage". Proceedings of the ninth international conference on Architectural support for programming languages and operating systems. pp. 190–201. doi:10.1145/378993.379239. ISBN 1581133170.
  13. ^ Plank, James; Beck, Micah; Elwasif, Wael; Moore, Terry; Swany, Martin; Wolski, Rich (October 1999). "The Internet Backplane Protocol: Storage in the Network" (PDF). Netstore 1999. Retrieved 27 January 2021.
  14. ^ Object Storage: What, How and Why?, NSF (Networking Storage Forum), SNIA (Storage Networking Industry Association), Live Webcast February 19, 2020
  15. ^ Anderson, D. (1999). "Object based storage devices: a command set proposal" (PDF). T10. S2CID 59781155.
  16. ^ Object Based Storage: A Vision, slide presentation, Dave Anderson and Seagate Technology, October 13, 1999 https://www.t10.org/ftp/t10/document.99/99-341r0.pdf
  17. ^ Pritchard, Stephen (23 October 2020). "Unified file and object storage: The best of both worlds?". Computer Weekly.
  18. ^ a b Vajgel, Peter (30 April 2009). "Needle in a haystack: efficient storage of billions of photos". Retrieved 5 October 2021.
  19. ^ Riedel, Erik; Sami Iren (February 2007). "Object Storage and Applications" (PDF). Retrieved 3 November 2013.
  20. ^ "The Seagate Kinetic Open Storage Vision". Seagate. Retrieved 3 November 2013.
  21. ^ Gallagher, Sean (27 October 2013). "Seagate introduces a new drive interface: Ethernet". Ars Technica. Retrieved 3 November 2013.
  22. ^ Corbet, Jonathan (4 November 2008). "Linux and object storage devices". LWN.net. Retrieved 8 November 2013.
  23. ^ Dilger, Andreas. (PDF). IEEE MSST. Archived from the original (PDF) on 29 October 2013. Retrieved 27 October 2013.
  24. ^ . Archived from the original on 29 October 2013. Retrieved 27 October 2013.
  25. ^ "EMC Marks Five Years of EMC Centera Innovation and Market Leadership". EMC. 18 April 2007. Retrieved 3 November 2013.
  26. ^ . Techvalidate.com. Archived from the original on 24 September 2015. Retrieved 19 September 2013.
  27. ^ Robb, Drew (11 May 2011). "EMC World Continues Focus on Big Data, Cloud and Flash". Infostor. Retrieved 19 September 2013.
  28. ^ Hamilton, George. . Archived from the original on 15 March 2014. Retrieved 15 March 2014.
  29. ^ Mellor, Chris (1 July 2014). "Los Alamos National Laboratory likes it, puts Scality's RING on it". The Register. Retrieved 26 January 2015.
  30. ^ Miller, Rich (13 January 2013). "Facebook Builds Exabyte Data Centers for Cold Storage". Datacenterknowledge.com. Retrieved 6 November 2013.
  31. ^ Leung, Leo (17 May 2014). . Techexpectations.org. Archived from the original on 22 May 2014. Retrieved 23 May 2014.
  32. ^ Leung, Leo (January 11, 2012). . Archived from the original on 29 September 2013. Retrieved 27 October 2013.
  33. ^ Harris, Derrick (18 April 2013). "Amazon S3 goes exponential, now stores 2 trillion objects". Gigaom. Retrieved 17 September 2013.
  34. ^ Wilhelm, Alex (27 June 2013). "Microsoft: Azure powers 299M Skype users, 50M Office Web Apps users, stores 8.5T objects". thenextweb.com. Retrieved 18 September 2013.
  35. ^ Nelson, Fritz (4 April 2014). . Tom's IT Pro. Archived from the original on 6 May 2014. Retrieved 3 September 2014.
  36. ^ Calder, Brad. "Windows Azure Storage: A Highly Available Cloud Storage Service with Strong Consistency" (PDF). SOSP '11: Proceedings of the Twenty-third ACM SIGOPS Symposium on Operating Systems Principles. Association for Computing Machinery. ISBN 978-1-4503-0977-6. Retrieved 6 November 2013.
  37. ^ Potnis, Amita. "IDC MarketScape: Worldwide Object-Based Storage 2019 Vendor Assessment". idc.com. IDC. Retrieved 16 Feb 2020.
  38. ^ "INCITS 400-2004". InterNational Committee for Information Technology Standards. Retrieved 8 November 2013.
  39. ^ "INCITS 458-2011". InterNational Committee for Information Technology Standards. 15 March 2011. Retrieved 8 November 2013.
  40. ^ OpenStack Foundation. "Object Storage API overview". OpenStack Documentation. Retrieved 9 June 2017.

object, storage, also, known, object, based, storage, blob, storage, computer, data, storage, approach, that, manages, data, blobs, objects, opposed, other, storage, architectures, like, file, systems, which, manages, data, file, hierarchy, block, storage, whi. Object storage also known as object based storage 1 or blob storage is a computer data storage approach that manages data as blobs or objects as opposed to other storage architectures like file systems which manages data as a file hierarchy and block storage which manages data as blocks within sectors and tracks 2 Each object is typically associated with a variable amount of metadata and a globally unique identifier Object storage can be implemented at multiple levels including the device level object storage device the system level and the interface level In each case object storage seeks to enable capabilities not addressed by other storage architectures like interfaces that are directly programmable by the application a namespace that can span multiple instances of physical hardware and data management functions like data replication and data distribution at object level granularity Object storage systems allow retention of massive amounts of unstructured data in which data is written once and read once or many times 3 Object storage is used for purposes such as storing objects like videos and photos on Facebook songs on Spotify or files in online collaboration services such as Dropbox 4 One of the limitations with object storage is that it is not intended for transactional data as object storage was not designed to replace NAS file access and sharing it does not support the locking and sharing mechanisms needed to maintain a single accurately updated version of a file 3 Contents 1 History 1 1 Origins 1 2 Development 2 Architecture 2 1 Abstraction of storage 2 2 Inclusion of rich custom metadata within the object 2 3 Programmatic data management 3 Implementation 3 1 Cloud storage 3 2 Object based file systems 3 3 Object storage systems 3 4 Unified file and object storage 3 5 Captive object storage 3 6 Object based storage devices 4 Market adoption 5 Market analysis 6 Standards 6 1 Object based storage device standards 6 1 1 OSD version 1 6 1 2 OSD version 2 7 Differences between key value and object stores 8 See also 9 ReferencesHistory editOrigins edit Jim Starkey coined the term blob when working at Digital Equipment Corporation to refer to opaque data entities The terminology was adopted for Rdb VMS Blob is often humorously explained to be an abbreviation for binary large object According to Starkey this backronym arose when Terry McKiever working in marketing at Apollo Computer felt that the term needed to be an abbreviation McKiever began using the expansion Basic Large Object This was later eclipsed by the retroactive explanation of blobs as Binary Large Objects According to Starkey Blob don t stand for nothin Rejecting the acronym he explained his motivation behind the coinage saying A blob is the thing that ate Cincinnatti sic Cleveland or whatever referring to the 1958 science fiction film The Blob 5 In 1995 research led by Garth Gibson on Network Attached Secure Disks first promoted the concept of splitting less common operations like namespace manipulations from common operations like reads and writes to optimize the performance and scale of both 6 In the same year a Belgian company FilePool was established to build the basis for archiving functions Object storage was proposed at Gibson s Carnegie Mellon University lab as a research project in 1996 7 Another key concept was abstracting the writes and reads of data to more flexible data containers objects Fine grained access control through object storage architecture 8 was further described by one of the NASD team Howard Gobioff who later was one of the inventors of the Google File System 9 Other related work includes the Coda filesystem project at Carnegie Mellon which started in 1987 and spawned the Lustre file system 10 There is also the OceanStore project at UC Berkeley 11 which started in 1999 12 and the Logistical Networking project at the University of Tennessee Knoxville which started in 1998 13 In 1999 Gibson founded Panasas to commercialize the concepts developed by the NASD team Development edit Seagate Technology played a central role in the development of object storage According to the Storage Networking Industry Association SNIA Object storage originated in the late 1990s Seagate specifications from 1999 Introduced some of the first commands and how operating system effectively removed from consumption of the storage 14 A preliminary version of the OBJECT BASED STORAGE DEVICES Command Set Proposal dated 10 25 1999 was submitted by Seagate as edited by Seagate s Dave Anderson and was the product of work by the National Storage Industry Consortium NSIC including contributions by Carnegie Mellon University Seagate IBM Quantum and StorageTek 15 This paper was proposed to INCITS T 10 International Committee for Information Technology Standards with a goal to form a committee and design a specification based on the SCSI interface protocol This defined objects as abstracted data with unique identifiers and metadata how objects related to file systems along with many other innovative concepts Anderson presented many of these ideas at the SNIA conference in October 1999 The presentation revealed an IP Agreement that had been signed in February 1997 between the original collaborators with Seagate represented by Anderson and Chris Malakapalli and covered the benefits of object storage scalable computing platform independence and storage management 16 Architecture edit nbsp Abstraction of storage edit One of the design principles of object storage is to abstract some of the lower layers of storage away from the administrators and applications Thus data is exposed and managed as objects instead of blocks or exclusively files Objects contain additional descriptive properties which can be used for better indexing or management Administrators do not have to perform lower level storage functions like constructing and managing logical volumes to utilize disk capacity or setting RAID levels to deal with disk failure Object storage also allows the addressing and identification of individual objects by more than just file name and file path Object storage adds a unique identifier within a bucket or across the entire system to support much larger namespaces and eliminate name collisions Inclusion of rich custom metadata within the object edit Object storage explicitly separates file metadata from data to support additional capabilities As opposed to fixed metadata in file systems filename creation date type etc object storage provides for full function custom object level metadata in order to Capture application specific or user specific information for better indexing purposes Support data management policies e g a policy to drive object movement from one storage tier to another Centralize management of storage across many individual nodes and clusters Optimize metadata storage e g encapsulated database or key value storage and caching indexing when authoritative metadata is encapsulated with the metadata inside the object independently from the data storage e g unstructured binary storage Additionally in some object based file system implementations The file system clients only contact metadata servers once when the file is opened and then get content directly via object storage servers vs block based file systems which would require constant metadata access Data objects can be configured on a per file basis to allow adaptive stripe width even across multiple object storage servers supporting optimizations in bandwidth and I OObject based storage devices OSD as well as some software implementations e g DataCore Swarm manage metadata and data at the storage device level Instead of providing a block oriented interface that reads and writes fixed sized blocks of data data is organized into flexible sized data containers called objects Each object has both data an uninterpreted sequence of bytes and metadata an extensible set of attributes describing the object physically encapsulating both together benefits recoverability The command interface includes commands to create and delete objects write bytes and read bytes to and from individual objects and to set and get attributes on objects Security mechanisms provide per object and per command access controlProgrammatic data management edit Object storage provides programmatic interfaces to allow applications to manipulate data At the base level this includes Create read update and delete CRUD functions for basic read write and delete operations Some object storage implementations go further supporting additional functionality like object file versioning object replication life cycle management and movement of objects between different tiers and types of storage Most API implementations are REST based allowing the use of many standard HTTP calls Implementation editCloud storage edit Main article Cloud storage The vast majority of cloud storage available in the market leverages an object storage architecture Some notable examples are Amazon Web Services S3 which debuted in March 2006 Microsoft Azure Blob Storage Rackspace Cloud Files whose code was donated in 2010 to Openstack project and released as OpenStack Swift and Google Cloud Storage released in May 2010 Object based file systems edit Some distributed file systems use an object based architecture where file metadata is stored in metadata servers and file data is stored in object storage servers File system client software interacts with the distinct servers and abstracts them to present a full file system to users and applications Object storage systems edit Some early incarnations of object storage were used for archiving as implementations were optimized for data services like immutability not performance EMC Centera and Hitachi HCP formerly known as HCAP are two commonly cited object storage products for archiving Another example is Quantum ActiveScale Object Storage Platform More general purpose object storage systems came to market around 2008 Lured by the incredible growth of captive storage systems within web applications like Yahoo Mail and the early success of cloud storage object storage systems promised the scale and capabilities of cloud storage with the ability to deploy the system within an enterprise or at an aspiring cloud storage service provider Unified file and object storage edit A few object storage systems support Unified File and Object storage allowing clients to store objects on a storage system while simultaneously other clients store files on the same storage system 17 Other vendors in the area of Hybrid cloud storage are using Cloud storage gateways to provide a file access layer over object storage implementing file access protocols such as SMB and NFS Captive object storage edit Some large Internet companies developed their own software when object storage products were not commercially available or use cases were very specific Facebook famously invented their own object storage software code named Haystack to address their particular massive scale photo management needs efficiently 18 Object based storage devices edit Object storage at the protocol and device layer was proposed 20 years ago ambiguous and approved for the SCSI command set nearly 10 years ago ambiguous as Object based Storage Device Commands OSD 19 however it had not been put into production until the development of the Seagate Kinetic Open Storage platform 20 21 The SCSI command set for Object Storage Devices was developed by a working group of the SNIA for the T10 committee of the International Committee for Information Technology Standards INCITS 22 T10 is responsible for all SCSI standards Market adoption editOne of the first object storage products Lustre is used in 70 of the Top 100 supercomputers and 50 of the Top 500 23 As of June 16 2013 this includes 7 of the top 10 including the current fourth fastest system on the list China s Tianhe 2 and the seventh fastest the Titan supercomputer at the Oak Ridge National Laboratory 24 Object storage systems had good adoption in the early 2000s as an archive platform particularly in the wake of compliance laws like Sarbanes Oxley After five years in the market EMC s Centera product claimed over 3 500 customers and 150 petabytes shipped by 2007 25 Hitachi s HCP product also claims many petabyte scale customers 26 Newer object storage systems have also gotten some traction particularly around very large custom applications like eBay s auction site where EMC Atmos is used to manage over 500 million objects a day 27 As of March 3 2014 EMC claims to have sold over 1 5 exabytes of Atmos storage 28 On July 1 2014 Los Alamos National Lab chose the Scality RING as the basis for a 500 petabyte storage environment which would be among the largest ever 29 Captive object storage systems like Facebook s Haystack have scaled impressively In April 2009 Haystack was managing 60 billion photos and 1 5 petabytes of storage adding 220 million photos and 25 terabytes a week 18 Facebook more recently stated that they were adding 350 million photos a day and were storing 240 billion photos 30 This could equal as much as 357 petabytes 31 Cloud storage has become pervasive as many new web and mobile applications choose it as a common way to store binary data 32 As the storage back end to many popular applications like Smugmug and Dropbox Amazon S3 has grown to massive scale citing over 2 trillion objects stored in April 2013 33 Two months later Microsoft claimed that they stored even more objects in Azure at 8 5 trillion 34 By April 2014 Azure claimed over 20 trillion objects stored 35 Windows Azure Storage manages Blobs user files Tables structured storage and Queues message delivery and counts them all as objects 36 Market analysis editIDC has begun to assess the object based storage market annually using its MarketScape methodology IDC describes the MarketScape as a quantitative and qualitative assessment of the characteristics that assess a vendor s current and future success in the said market or market segment and provide a measure of their ascendancy to become a Leader or maintain a leadership IDC MarketScape assessments are particularly helpful in emerging markets that are often fragmented have several players and lack clear leaders 37 In 2019 IDC rated Dell EMC Hitachi Data Systems IBM NetApp and Scality as leaders Standards editObject based storage device standards edit OSD version 1 edit In the first version of the OSD standard 38 objects are specified with a 64 bit partition ID and a 64 bit object ID Partitions are created and deleted within an OSD and objects are created and deleted within partitions There are no fixed sizes associated with partitions or objects they are allowed to grow subject to physical size limitations of the device or logical quota constraints on a partition An extensible set of attributes describe objects Some attributes are implemented directly by the OSD such as the number of bytes in an object and the modification time of an object There is a special policy tag attribute that is part of the security mechanism Other attributes are uninterpreted by the OSD These are set on objects by the higher level storage systems that use the OSD for persistent storage For example attributes might be used to classify objects or to capture relationships among different objects stored on different OSDs A list command returns a list of identifiers for objects within a partition optionally filtered by matches against their attribute values A list command can also return selected attributes of the listed objects Read and write commands can be combined or piggy backed with commands to get and set attributes This ability reduces the number of times a high level storage system has to cross the interface to the OSD which can improve overall efficiency OSD version 2 edit A second generation of the SCSI command set Object Based Storage Devices 2 OSD 2 added support for snapshots collections of objects and improved error handling 39 A snapshot is a point in time copy of all the objects in a partition into a new partition The OSD can implement a space efficient copy using copy on write techniques so that the two partitions share objects that are unchanged between the snapshots or the OSD might physically copy the data to the new partition The standard defines clones which are writeable and snapshots which are read only A collection is a special kind of object that contains the identifiers of other objects There are operations to add and delete from collections and there are operations to get or set attributes for all the objects in a collection Collections are also used for error reporting If an object becomes damaged by the occurrence of a media defect i e a bad spot on the disk or by a software error within the OSD implementation its identifier is put into a special error collection The higher level storage system that uses the OSD can query this collection and take corrective action as necessary Differences between key value and object stores editThe border between an object store and a key value store is blurred with key value stores being sometimes loosely referred to as object stores A traditional block storage interface uses a series of fixed size blocks which are numbered starting at 0 Data must be that exact fixed size and can be stored in a particular block which is identified by its logical block number LBN Later one can retrieve that block of data by specifying its unique LBN With a key value store data is identified by a key rather than a LBN A key might be cat or olive or 42 It can be an arbitrary sequence of bytes of arbitrary length Data called a value in this parlance does not need to be a fixed size and also can be an arbitrary sequence of bytes of arbitrary length One stores data by presenting the key and data value to the data store and can later retrieve the data by presenting the key This concept is seen in programming languages Python calls them dictionaries Perl calls them hashes Java and C call them maps etc Several data stores also implement key value stores such as Memcached Redis and CouchDB Object stores are similar to key value stores in two respects First the object identifier or URL the equivalent of the key can be an arbitrary string 40 Second data may be of an arbitrary size There are however a few key differences between key value stores and object stores First object stores also allow one to associate a limited set of attributes metadata with each piece of data The combination of a key value and set of attributes is referred to as an object Second object stores are optimized for large amounts of data hundreds of megabytes or even gigabytes whereas for key value stores the value is expected to be relatively small kilobytes Finally object stores usually offer weaker consistency guarantees such as eventual consistency whereas key value stores offer strong consistency See also editBlock storage File storage Cloud storage Clustered file system Object access methodReferences edit Mesnier M Ganger G R Riedel E August 2003 Storage area networking Object based storage IEEE Communications Magazine 41 8 84 90 doi 10 1109 mcom 2003 1222722 Porter De Leon Yadin Tony Piscopo 14 August 2014 Object Storage versus Block Storage Understanding the Technology Differences Druva Retrieved 19 January 2015 a b Erwin Derek 2022 Block Storage vs Object Storage vs File Storage What s the Difference Qumulo Retrieved 8 February 2022 Object storage can work well for unstructured data in which data is written once and read once or many times Static online content data backups image archives videos pictures and music files can be stored as objects Chandrasekaran Arun Dayley Alan 11 February 2014 Critical Capabilities for Object Storage Gartner Research Archived from the original on 2014 03 16 Starkey James 1997 01 22 The true story of BLOBs Retrieved 2023 11 08 Garth A Gibson Nagle D Amiri K Chan F Feinberg E Gobioff H Lee C Ozceri B Riedel E Rochberg D Zelenka J File Server Scaling with Network Attached Secure Disks PDF Proceedings of the ACM International Conference on Measurement and Modeling of Computer Systems Sigmetrics 97 Retrieved 27 October 2013 Factor Michael Meth K Naor D Rodeh O Satran J 2005 Object Storage The Future Building Block for Storage Systems pp 119 123 CiteSeerX 10 1 1 122 3959 Gobioff Howard Gibson Garth A Tygar Doug 1 October 1997 Security for Network Attached Storage Devices CMU CS 97 185 Parallel Data Laboratory Retrieved 7 November 2013 Sanjay Ghemawat Howard Gobioff Shun Tak Leung October 2003 The Google File System PDF Retrieved 7 November 2013 Braam Peter Lustre The intergalactic file system PDF Retrieved 17 September 2013 OceanStore Archived from the original on 8 August 2012 Retrieved 18 September 2013 Kubiatowicz John Wells Chris Zhao Ben Bindel David Chen Yan Czerwinski Steven Eaton Patrick Geels Dennis Gummadi Ramakrishna Rhea Sean Weatherspoon Hakim 2000 OceanStore An architecture for global scale persistent storage Proceedings of the ninth international conference on Architectural support for programming languages and operating systems pp 190 201 doi 10 1145 378993 379239 ISBN 1581133170 Plank James Beck Micah Elwasif Wael Moore Terry Swany Martin Wolski Rich October 1999 The Internet Backplane Protocol Storage in the Network PDF Netstore 1999 Retrieved 27 January 2021 Object Storage What How and Why NSF Networking Storage Forum SNIA Storage Networking Industry Association Live Webcast February 19 2020 Anderson D 1999 Object based storage devices a command set proposal PDF T10 S2CID 59781155 Object Based Storage A Vision slide presentation Dave Anderson and Seagate Technology October 13 1999 https www t10 org ftp t10 document 99 99 341r0 pdf Pritchard Stephen 23 October 2020 Unified file and object storage The best of both worlds Computer Weekly a b Vajgel Peter 30 April 2009 Needle in a haystack efficient storage of billions of photos Retrieved 5 October 2021 Riedel Erik Sami Iren February 2007 Object Storage and Applications PDF Retrieved 3 November 2013 The Seagate Kinetic Open Storage Vision Seagate Retrieved 3 November 2013 Gallagher Sean 27 October 2013 Seagate introduces a new drive interface Ethernet Ars Technica Retrieved 3 November 2013 Corbet Jonathan 4 November 2008 Linux and object storage devices LWN net Retrieved 8 November 2013 Dilger Andreas Lustre Future Development PDF IEEE MSST Archived from the original PDF on 29 October 2013 Retrieved 27 October 2013 Datadirect Networks to build world s fastest storage system for Titan the world s most powerful supercomputer Archived from the original on 29 October 2013 Retrieved 27 October 2013 EMC Marks Five Years of EMC Centera Innovation and Market Leadership EMC 18 April 2007 Retrieved 3 November 2013 Hitachi Content Platform Supports Multiple Petabytes Billions of Objects Techvalidate com Archived from the original on 24 September 2015 Retrieved 19 September 2013 Robb Drew 11 May 2011 EMC World Continues Focus on Big Data Cloud and Flash Infostor Retrieved 19 September 2013 Hamilton George In it for the Long Run EMC s Object Storage Leadership Archived from the original on 15 March 2014 Retrieved 15 March 2014 Mellor Chris 1 July 2014 Los Alamos National Laboratory likes it puts Scality s RING on it The Register Retrieved 26 January 2015 Miller Rich 13 January 2013 Facebook Builds Exabyte Data Centers for Cold Storage Datacenterknowledge com Retrieved 6 November 2013 Leung Leo 17 May 2014 How much data does x store Techexpectations org Archived from the original on 22 May 2014 Retrieved 23 May 2014 Leung Leo January 11 2012 Object storage already dominates our days we just didn t notice Archived from the original on 29 September 2013 Retrieved 27 October 2013 Harris Derrick 18 April 2013 Amazon S3 goes exponential now stores 2 trillion objects Gigaom Retrieved 17 September 2013 Wilhelm Alex 27 June 2013 Microsoft Azure powers 299M Skype users 50M Office Web Apps users stores 8 5T objects thenextweb com Retrieved 18 September 2013 Nelson Fritz 4 April 2014 Microsoft Azure s 44 New Enhancements 20 Trillion Objects Tom s IT Pro Archived from the original on 6 May 2014 Retrieved 3 September 2014 Calder Brad Windows Azure Storage A Highly Available Cloud Storage Service with Strong Consistency PDF SOSP 11 Proceedings of the Twenty third ACM SIGOPS Symposium on Operating Systems Principles Association for Computing Machinery ISBN 978 1 4503 0977 6 Retrieved 6 November 2013 Potnis Amita IDC MarketScape Worldwide Object Based Storage 2019 Vendor Assessment idc com IDC Retrieved 16 Feb 2020 INCITS 400 2004 InterNational Committee for Information Technology Standards Retrieved 8 November 2013 INCITS 458 2011 InterNational Committee for Information Technology Standards 15 March 2011 Retrieved 8 November 2013 OpenStack Foundation Object Storage API overview OpenStack Documentation Retrieved 9 June 2017 Retrieved from https en wikipedia org w index php title Object storage amp oldid 1198074202, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.