• Home
  • Archive
  • Tools
  • Contact Us

The Customize Windows

Technology Journal

  • Cloud Computing
  • Computer
  • Digital Photography
  • Windows 7
  • Archive
  • Cloud Computing
  • Virtualization
  • Computer and Internet
  • Digital Photography
  • Android
  • Sysadmin
  • Electronics
  • Big Data
  • Virtualization
  • Downloads
  • Web Development
  • Apple
  • Android
Advertisement
You are here: Home » What is Semi-Structured Data?

By Abhishek Ghosh April 30, 2020 8:21 pm Updated on April 30, 2020

What is Semi-Structured Data?

Advertisement

Semi-structured data is the name given in database research (informatics) to information which is not subject to a general structure, but which carry with it part of the structural information. While structured data retention must be based on a database model that contains the appearance of the data elements (objects), semi-structured data lacks one. Semi-structured data does not need to be subjected to a type model; thus, a data collection from semi-structured data can expand as desired. Semi-structured data can be brought into a form with the help of rules, which has the characteristics (1) The data collection consists of one or more sequences of objects.(2) Objects can either be decomposed into attributes (complex objects) or they are atomic objects.
(3) Atomic objects contain values of a known, elementary data type. Semi-structured data with properties (1), (2), and (3) are called well-formed semi-structured data. The Object Exchange Model (OE model) has become a de facto model for semi-structured data. Data that has these properties can also be described as well-formed XML documents.

 

Is semi-structured not also structured?

 

Semi-structured data cannot be placed in a structured database model except for one exception described below. However, there are procedures in place to detect data types of semi-structured data. If the data types (classes) and thus the relations are known, you have an entity-relationship model. However, for this model, it can only be filled with data in this structure, not with other semi-structured data. For semi-structured files shaped in an OE model, it can also be claimed that the formal description of an OE model allows you to create a matching, structured data model that can look like this:

  • This data model contains only three basic types: the nodes that represent the objects, the edges, attributes, or references, and sheets that represent the properties of the reference.
  • Thus, all semi-structured objects of an OEM model can also be written into this data model. The following is an OEM DB model.
  • Semi-structured data cannot be written into a DB model except models that have only one abstract data type for all objects.

 

Notation of semi-structured data

 

The notation of semi-structured data with XML, which has been standardized by the W3 consortium, is very widespread. This serves as a data exchange format on the Internet and is additionally used as a data storage format in many applications.In XML, attributes can be noted with the following notation for so-called elements whose name can be freely set:

Advertisement

---

Vim
1
<element [attribut_1="wert_1"] [attribut_2="wert_2"] [attribut_n="wert_n"]> content1 <unterelement_1/> <unterelement_2/> .... </element>

There are two ways to specify properties of objects within XML – (1) by XML attributes (2) by sub-elements

The so-called ssd (semi-structured-data) notation is less well known than the XML. However, this notation for semi-structured data provides a very short and clear presentation. There is another notation for the XML documents, which is called DTD (Document Type Definition). This notation describes the structure of an XML document.
XML files with DTD are more structured than XML files without DTD. XML files without DTD have no typing. Within an XML document, elements or tags and their attributes can be defined as desired, without any restrictions. In principle, it is possible for the DTD to define only a portion of the elements within the XML document. With the help of a DTD, it is possible to define which elements may exist and which attributes these elements may or must have; the amount of possible values can also be limited. In addition, the set of possible children can be defined with DTDs. The types described in the DTD can be implied. Although the XML document is subject to an object description, structured data cannot be spoken of.

Despite the possibility of further structuring with DTDs, we are still at the semi-structured level of data storage. This is due to the fact that structured data is technically subject to a so-called data dictionary, which describes the structure of the data. The structure of the entities includes the relationships, attributes, and values with their data types. It is not possible to access the stored data without the data dictionary. It is different for semi-structured data, which is basically structured like a text file. Also, the values of the attributes are not defined with data structure specifications such as string, integer, float, date, number, etc., but are generally represented as strings. Thus, an XML file validated with a DTD can be edited and modified independently of the DTD. Different XML files, which in turn can be validated with the same DTD, thus belong to the same equivalence class.

Since the structure of the DTD is derived from the processing algorithms, semi-structured data in XML with DTD can only be generated by a program in one version and further processed with a program and a version – unless semantically oriented queries or processing methods are used in further processing. DTDs may also be created by type recognition methods, such as simulation, because this method detects types of objects “classes”. Program changes, as seen here in the analysis system, also lead to the adaptation of the DTD. In addition, the semi-structured concept offers the possibility that elements that describe words and sentence phrases in this case can sequence each other at will. DTD notation provides parameters entities that allow any order and number of sub-elements of a parent. This is not possible directly with structured ER modeling.

What is Semi-Structured Data

JSON is an open standard format that uses human-readable text to convey data objects in attribute–value pairs, primarily to transmit data between a server and web application, or an alternative to XML. JSON has been popularized by web services utilizing REST. Databases such as MongoDB and Couchbase can store data natively in JSON format for semi-structured data.

 

Advantages and disadvantages of semi-structured data

 

Programmers can avoid worrying about object-relational impedance mismatch. Nested or hierarchical data simplifies data models in many situations. Support for lists of objects simplifies data models. But, the method of data storage is prone to garbage in, garbage out by removing restraints.

Tagged With semi-structured data , 2 Semi structured , semi structured - data type has elements of standardized and predictable , semi structured data

This Article Has Been Shared 223 Times!

Facebook Twitter Pinterest

Abhishek Ghosh

About Abhishek Ghosh

Abhishek Ghosh is a Businessman, Surgeon, Author and Blogger. You can keep touch with him on Twitter - @AbhishekCTRL.

Here’s what we’ve got for you which might like :

Articles Related to What is Semi-Structured Data?

  • Installing Local Data Lake on Ubuntu Server : Part 1

    Here is Part 1 of Installing Local Data Lake on Ubuntu Server With Hadoop, Spark, Thriftserver, Jupyter etc To Build a Prediction System.

  • Install Apache Mahout : Ubuntu 16.04 For Machine Learning Dev

    Here Is How To Install Apache Mahout On Ubuntu 16.04 For Machine Learning Development. We Can Install & Integrate Mahout With Spark, Hadoop.

  • Chart, Data Visualization in WordPress Posts From SQL & SQL Queries

    Displaying SQL result data may be a need. Here is How to Get Chart, Data Visualization in WordPress Posts From SQL Queries in Easy Way.

  • How To Install Apache Maven on Ubuntu Server

    Apache Maven is a Build Automation Tool. Here Are the Steps on How To Install Apache Maven on Ubuntu Server. Maven Needed For Many Big Data Software.

  • Influence of Digitalization on Marketing : Part III

    In the previous part of this series, under the sub-header “Marketing strategies”, we have discussed about Customer requirement and Customer acquisition. For many companies, online marketing is the most used medium for customer acquisition. The other points to discuss under the sub-header “Marketing strategies” are Customer loyalty and The challenge for marketing. “Digitization in the […]

Additionally, performing a search on this website can help you. Also, we have YouTube Videos.

Take The Conversation Further ...

We'd love to know your thoughts on this article.
Meet the Author over on Twitter to join the conversation right now!

If you want to Advertise on our Article or want a Sponsored Article, you are invited to Contact us.

Contact Us

Subscribe To Our Free Newsletter

Get new posts by email:

Please Confirm the Subscription When Approval Email Will Arrive in Your Email Inbox as Second Step.

Search this website…

 

Popular Articles

Our Homepage is best place to find popular articles!

Here Are Some Good to Read Articles :

  • Cloud Computing Service Models
  • What is Cloud Computing?
  • Cloud Computing and Social Networks in Mobile Space
  • ARM Processor Architecture
  • What Camera Mode to Choose
  • Indispensable MySQL queries for custom fields in WordPress
  • Windows 7 Speech Recognition Scripting Related Tutorials

Social Networks

  • Pinterest (24.3K Followers)
  • Twitter (5.8k Followers)
  • Facebook (5.7k Followers)
  • LinkedIn (3.7k Followers)
  • YouTube (1.3k Followers)
  • GitHub (Repository)
  • GitHub (Gists)
Looking to publish sponsored article on our website?

Contact us

Recent Posts

  • What Online Casinos Have No Deposit Bonus in Australia March 30, 2023
  • Four Foolproof Tips To Never Run Out Of Blog Ideas For Your Website March 28, 2023
  • The Interactive Entertainment Serving as a Tech Proving Ground March 28, 2023
  • Is it Good to Run Apache Web server and MySQL Database on Separate Cloud Servers? March 27, 2023
  • Advantages of Cloud Server Over Dedicated Server for Hosting WordPress March 26, 2023

About This Article

Cite this article as: Abhishek Ghosh, "What is Semi-Structured Data?," in The Customize Windows, April 30, 2020, March 30, 2023, https://thecustomizewindows.com/2020/04/what-is-semi-structured-data/.

Source:The Customize Windows, JiMA.in

PC users can consult Corrine Chorney for Security.

Want to know more about us? Read Notability and Mentions & Our Setup.

Copyright © 2023 - The Customize Windows | dESIGNed by The Customize Windows

Copyright  · Privacy Policy  · Advertising Policy  · Terms of Service  · Refund Policy

We use cookies on our website to give you the most relevant experience by remembering your preferences and repeat visits. By clicking “Accept”, you consent to the use of ALL the cookies.
Do not sell my personal information.
Cookie SettingsAccept
Manage consent

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.
Necessary
Always Enabled
Necessary cookies are absolutely essential for the website to function properly. These cookies ensure basic functionalities and security features of the website, anonymously.
CookieDurationDescription
cookielawinfo-checkbox-analytics11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional11 monthsThe cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy11 monthsThe cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
Functional
Functional cookies help to perform certain functionalities like sharing the content of the website on social media platforms, collect feedbacks, and other third-party features.
Performance
Performance cookies are used to understand and analyze the key performance indexes of the website which helps in delivering a better user experience for the visitors.
Analytics
Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics the number of visitors, bounce rate, traffic source, etc.
Advertisement
Advertisement cookies are used to provide visitors with relevant ads and marketing campaigns. These cookies track visitors across websites and collect information to provide customized ads.
Others
Other uncategorized cookies are those that are being analyzed and have not been classified into a category as yet.
SAVE & ACCEPT