• Home
  • About Us
  • Contact Us
  • Privacy Policy
  • Special Offers
Business Intelligence Info
  • Business Intelligence
    • BI News and Info
    • Big Data
    • Mobile and Cloud
    • Self-Service BI
  • CRM
    • CRM News and Info
    • InfusionSoft
    • Microsoft Dynamics CRM
    • NetSuite
    • OnContact
    • Salesforce
    • Workbooks
  • Data Mining
    • Pentaho
    • Sisense
    • Tableau
    • TIBCO Spotfire
  • Data Warehousing
    • DWH News and Info
    • IBM DB2
    • Microsoft SQL Server
    • Oracle
    • Teradata
  • Predictive Analytics
    • FICO
    • KNIME
    • Mathematica
    • Matlab
    • Minitab
    • RapidMiner
    • Revolution
    • SAP
    • SAS/SPSS
  • Humor

Data Transformation in Practice: 3 Real-World Data Transformation Examples

November 21, 2018   Big Data
Data Transformation in Practice 3 Real World Data Transformation Examples Data Transformation in Practice: 3 Real World Data Transformation Examples
Christopher Tozzi avatar 1476151897 54x54 Data Transformation in Practice: 3 Real World Data Transformation Examples

Christopher Tozzi

November 20, 2018

Data transformation is a buzzword you hear frequently in the age of big data (even though data transformation’s significance is not limited just to big data). And while it’s easy to define data transformation at a high level, understanding what data transformation means in practice can be trickier.

If you’ve found yourself pondering what data transformation actually entails, keep reading for some real-world situations in which data needs to be transformed, and what the transformation requires.

What is data transformation?

As the term implies, data transformation means taking data stored in one format and converting it to another.

As a computer end-user, you probably perform basic data transformations on a routine basis. When you convert a Microsoft Word file to a PDF, for example, you are transforming data.

But data transformation plays a more important and sophisticated role in big data analytics. That is because when you are dealing with large volumes of data, different types of data analytics tools and different data storage systems, you are likely to encounter situations where a large amount of data needs to be transformed from one format to another.

So, that’s the high-level definition of data transformation. To illustrate the concept more precisely, let’s take a look at some real-world examples of data transformation.

The New Rules for Your Data Landscape Data Transformation in Practice: 3 Real World Data Transformation Examples

Character encoding and data transformation

Character encoding problems are a common reason for data transformation. To understand why, you first have to understand the point of character encoding.

Character encoding lets computers represent individual letters (or other characters) based on codes. In other words, every letter in the alphabet (as well as other text symbols, such as commas or periods) has can be represented as a code. As long as your computer knows which particular code was used to represent letters in a given set of data, it can decode the data in order to translate it into letters and numbers and print them on your screen.

The problem that often arises out of character encoding is that sometimes, one application encodes characters using one type of encoding scheme, then sends that information to another application that uses a different encoding scheme by default. When the second application tries to open the data, it may not be able to interpret all of the characters.

If you have ever opened a file and found that some of the letters or numbers inside text are represented as gibberish or seemingly random symbols, there is a decent chance than character encoding inconsistency is the reason.

Today, most computers following the UTF-8 encoding scheme, or a newer scheme that is backwards-compatible with it, in order to prevent encoding issues. But it still happens that an application encodes data in a way that other applications or computers do not expect. In these cases, the data would need to be transformed from one type of character encoding format to another.

CSV to XML data transformation

CSV (short for comma-separated values) and XML (or extensible markup language) are two popular ways of storing data today. But they work quite differently.

In a CSV file, you use commas to distinguish different data values from each other. In XML, you primarily represent data using tags that define different units of data and their values. Both types of files also make it possible to represent data hierarchies, but they do so in different ways.

Because CSV and XML work so differently, an application that was designed to read and write data in CSV format can’t typically open an XML file, and vice-versa.

This is where data transformation comes in. Using a data transformation tool, you can automatically convert data from a CSV file into XML format so that you can open it with the right tools.

Speech-to-text and data transformation

A third data transformation example — and one that highlights just how broadly you should think about data transformation — is a situation in which you have human speech that is stored in an audio file, but want to translate that information into a text file.

While you could technically open an audio file as a text file (by, for example, running a command like “cat myfile.wav” in your terminal on Linux or macOS), the audio file would not make much sense when opened this way. It would be a bunch of gibberish.

To make the audio data actually usable to humans who can’t listen to it — or to transform it into a format that allows it to be parsed automatically by an application that reads text — you would want to transform the speech in the audio file into a text file.

You could do this manually by listening to the audio and transcribing the speech in it. Or you could use a speech-to-text tool to automate the process. If you were transforming data on a large scale, you would want to take the automatic approach.

This example might not be one of the first to come to mind when data transformation specialists think of data transformation examples, because it involves more than dealing with data formatting differences. But it is an example of data transformation in a broad sense. It’s also a scenario that you may well face if, for example, you record phone calls with customers and want a way of making data from the conversations available for parsing by tools that can only interpret textual information.

Make sure to download our eBook, “The New Rules for Your Data Landscape“, and take a look at the rules that are transforming the relationship between business and IT.

Let’s block ads! (Why?)

Syncsort Blog

data, Examples, Practice, Realworld, Transformation
  • Recent Posts

    • The Easier Way For Banks To Handle Data Security While Working Remotely
    • 3 Ways Data Virtualization is Evolving to Meet Market Demands
    • Did you find everything you need today?
    • Missing Form Editor through command bar in Microsoft Dynamics 365
    • I’m So Excited
  • Categories

  • Archives

    • April 2021
    • March 2021
    • February 2021
    • January 2021
    • December 2020
    • November 2020
    • October 2020
    • September 2020
    • August 2020
    • July 2020
    • June 2020
    • May 2020
    • April 2020
    • March 2020
    • February 2020
    • January 2020
    • December 2019
    • November 2019
    • October 2019
    • September 2019
    • August 2019
    • July 2019
    • June 2019
    • May 2019
    • April 2019
    • March 2019
    • February 2019
    • January 2019
    • December 2018
    • November 2018
    • October 2018
    • September 2018
    • August 2018
    • July 2018
    • June 2018
    • May 2018
    • April 2018
    • March 2018
    • February 2018
    • January 2018
    • December 2017
    • November 2017
    • October 2017
    • September 2017
    • August 2017
    • July 2017
    • June 2017
    • May 2017
    • April 2017
    • March 2017
    • February 2017
    • January 2017
    • December 2016
    • November 2016
    • October 2016
    • September 2016
    • August 2016
    • July 2016
    • June 2016
    • May 2016
    • April 2016
    • March 2016
    • February 2016
    • January 2016
    • December 2015
    • November 2015
    • October 2015
    • September 2015
    • August 2015
    • July 2015
    • June 2015
    • May 2015
    • April 2015
    • March 2015
    • February 2015
    • January 2015
    • December 2014
    • November 2014
© 2021 Business Intelligence Info
Power BI Training | G Com Solutions Limited