You are on page 1of 5

Volume 7, Issue 8, August – 2022 International Journal of Innovative Science and Research Technology

ISSN No:-2456-2165

Cause and Effect of Data Migration in Cloud


Computing
Sourav Kumar Upadhyay1 Dr. Prakash Kumar2
1 2
Assistant Professor, Department of CA & CS, Assistant Professor (HOD), Department of CA & CS,
JRSU, Ranchi, India JRSU, Ranchi, India

Abstract:- The cloud data migration idea includes data the needs of the business, and we must plan an affordable and
warehouses that deal with flat files from many sources and useful data repository [1].
are used for data analysis and validation. In this paper, we
describe about Data warehouse and its types, Migration
types, Cause and consideration of data migration in cloud
and Effect and possible risks Data Migration Cloud.

Keywords:- Cloud Migration, Data Warehouse, Data


Migration in Cloud Computing, Cause and consideration of
data migration in cloud, Effect and possible risks in Data
Migration Cloud.

I. INTRODUCTION

An organization can migrate to the cloud in a number of


ways. Data and apps can be moved from a nearby on-premises
data center to the public cloud using a similar model. Moving
data and applications from one cloud platform or provider to
another can also be a part of a cloud migration, often known as
a cloud-to-cloud transfer. Reverse cloud migration, also known Fig.1. Data Warehouse [2]
as cloud repatriation or cloud departure, is a third type of
migration in which data or applications are transferred from the B. Kinds of Data Warehouse
cloud and back to a local data center. The information is
unstructured and kept in many places. Before beginning the B.1. Enterprise Data Warehouse (EDW): An enterprise-wide
validation procedure, the staging area needs the other sources' warehouse that aids in providing decision support services. It
datasets to be obtained and loaded. The filtered information will presents a uniform method for representing and arranging data.
all be kept in tabular form in a tabular format in the cloud after This improves the process of classifying data according to
the validation procedure. Cloud migration is the process of subject and granting access based on data divisions.
migrating software, data, or other corporate assets to cloud
storage settings. It updates on-premises data to cloud storage
and works with validated flat files from various sources.

A. Data Warehouse
Repository is made up of a single, enormous collection of
uniform data that has been saved from several sources. All users
or clients find it convenient for their market data reports and
progress outlines. Data services and repository have a role in
data charges. Its environment includes a data repository, a data
mart, and metadata. A portion of a data warehouse is known as
a "data mart." The data repository's purpose is shown in Figure
1. There are numerous data-related operations in Fig.2. Enterprise Data Warehouse [3]
multidimensional space, including extracting, compressing, and
manipulating. The turnaround time can then be shortened by B.2. Operational Data Store [ODS]: It primarily provides
processing it right away. Having Depending on the kind of end organizational reporting, and neither an OLTP system nor a
users and data redundancy, a data warehouse is crucial. The data warehouse is used to store the data. ODS performs tasks
concept of data warehouses has been drastically altered by the like storing customer or employee data since the data
fusion of many technologies. We must research and understand warehouse is always being updated.

IJISRT22AUG232 www.ijisrt.com 45
Volume 7, Issue 8, August – 2022 International Journal of Innovative Science and Research Technology
ISSN No:-2456-2165
We can see from the diagram that data can be moved
across different computer systems or formats. It is one of the
most important things to keep in mind when putting a system
into place, consolidating, or updating. The introduction of a
new data structure or venue frequently causes this. Migration
techniques are customized as needed, and hardware and
software specs are validated. To make sure that specifications
and customizable configurations function as expected, pre-
validation testing can also be done. The transfer process, which
comprises the crucial operations of data retrieval (reading data
from the previous system), and data loading, begins if
everything checks okay (writing data to the new system).
Fig.3. ODS [4]

B.3. Data Mart: Data Mart is a useful subset of the data


warehouse. It is specifically made for a certain sales, business,
or financial line. Data may be gathered directly from sources or
indirectly through various data processing centers by a separate
data mart.

Fig.6. Extract, Transform, Load Sketch [7]

A. After Relocation
After data transfer, the results are examined to determine
whether the data was correctly understood, was complete, and
adhered to the procedures used by the current system. A
concurrent execution of both systems may be necessary during
Fig.4. Data Mart [5] verification to find areas of divergence and avoid losing
inaccurate data. Once the migration is certified complete,
II. MIGRATION OF DATA additional documentation and analysis of the migration process
will be finished, and existing systems will be retired. Close-out
Transferring information to a new, updated device or sessions will put a stop to the transfer procedure.
location is a process known as data migration. Data is gathered,
scheduled, and converted in order to be permanently transferred B. Kinds of Migration
from one device's storage to another. Database migration There are four main types of data migrations, and each one
services are growing in popularity as businesses place a greater requires sufficient planning and verification before
emphasis on technology advancements and optimization. implementation.

B.1. Database Migration: Either the functionality of the


database is upgraded, or the entire database is passed from one
supplier to another. The foundation of every technology we use
on a daily basis is a database. It makes sense that SMBs would
likewise change database providers, update their programs, or
move their databases to the cloud. Data from two separate
database engines must be transferred as part of the database
migration process.

B.2. Storage Migration: Storage conversion describes the


transfer of data from one storage media to another. This entails
physically moving data blocks from one type of hardware (such
Fig.5. Sketch of Migration [6] as tapes or discs) to another. Moving data from one storage
media, like a hard disc or the cloud, to another is known as data

IJISRT22AUG232 www.ijisrt.com 46
Volume 7, Issue 8, August – 2022 International Journal of Innovative Science and Research Technology
ISSN No:-2456-2165
conversion. Data is transferred from one storage medium to operations of the source and target systems. Alternately, you
another, to put it another way. may arrange a substantial transfer outside of normal business
hours (if you can accomplish it the available window).
B.3. Business Process Migration: Through this procedure, data,
software, and other business-related items are moved from an A.2. Data Volume: On a client-provided storage device,
on-premises data center to a cloud or from one cloud to another. shipping the data to its new storage location is typically the
It is focused on a company's operational procedures, quickest and affordable option when migrating less than 10
particularly outdated or in need of replacement business terabytes (TB) of data. The most practical and cost-effective
management instruments. Typically, a merger or takeover alternative for transfers involving bigger volumes of data, such
causes this to happen. as up to multiple petabytes (PB), may be a specialized data
movement tool that your cloud provider offers. Although one
B.4. Application Migration: If the application vendor needs could theoretically use online migration for any amount of data,
change, transformation is a necessity. Given that every its viability for huge data sets is questionable and constrained
application relies on a certain data model, it is essential. It by time.
entails adapting application programs for a more contemporary
environment. It can move a whole application structure from an A.3. Quickness of completion: How soon online migrations are
on-premise IT foundation across clouds or to the cloud. finished depends on how much data is being transmitted and
how quickly your network connection is. For offline
III. CAUSE AND COSIDERATION OF DATA migrations, shipping time must be taken into consideration. If
MIGRATION IN CLOUD start-to-finish migration speed is your top goal and you have the
bandwidth to commit to the migration, online transfer may be
Businesses are under increasing pressure to maximize the the ideal option However, if your migration date is variable
value they derive from the data they collect today due to the and/or you have bandwidth or other networking limitations,
ever-increasing volume of data being produced. Success in this offline migration may be your best alternative. [8].
environment depends more and more on selecting the best
environments for your workloads and ensuring sure your data B. Suggested tools
is kept effectively and easily accessible. In an effort to host their There are many solutions available now to make
applications in the most economical and effective IT enterprise data migrations easier. These include both licensed
environment possible, many businesses are deciding to migrate and open source tools, as well as vendor-specific solutions that
workloads to the cloud. The process of planning a cloud cloud providers offer to assist their clients' migration into their
migration should start with early consideration of choosing the public or private cloud environment. The ideal tools for your
best data migration option. For the introduction of data- project will depend on your data migration approach. The
intensive technologies like databases, data centers, and data following are some common options (Table 1):
lakes as well as large-scale virtualization programs, data
transformation is crucial for upgrading or consolidating server Veeam To hasten and simplify the transfer of VM-
and storage infrastructure. Additionally, data transfer may take based workloads across hosts and storage
place between internal systems and cloud storage, as well as environments, Veeam provides a Quick
inside HDD or SDD-based systems. Migration tool for VMware vSphere.

A. Factors to take into account while developing a data Zerto Zerto provides a unified platform for
migration plan workload mobility, disaster recovery, and
The better your company manages the less likely you are backup that allows migrations of all sizes,
to incur unforeseen expenditures or unplanned downtime from the relocation of a single application to
during its data movement, and the less probable it is that your an entire data centre.
end users will get irritated or inconvenienced both during and
after the migration. One should define objectives, create a Cyberduck An open-source FTP and SFTP software
schedule, and be prepared for any difficulties that may arise. called Cyberduck can be used to move single
When choosing your strategy for the project, you should files or entire file volumes between systems
primarily take the following three things into account: or into the cloud.

Rclone Data can be moved to and from cloud object


A.1. Nature of the work: Tools provided by software vendors
storage using the free source command-line
that are unique to the type of data being migrated can typically
tool Rclone. Large items can be
be used to move specialized workloads like databases, backups,
automatically segmented and their
or virtual machines (VMs). If you lack access to these
components uploaded simultaneously.
resources, you should carefully prepare for any potential
outage. For mission-critical workloads, you can transfer data Table.1. Tools for Data Migration [8]
incrementally, testing along the way while retaining the parallel

IJISRT22AUG232 www.ijisrt.com 47
Volume 7, Issue 8, August – 2022 International Journal of Innovative Science and Research Technology
ISSN No:-2456-2165
IV. EFFECT AND POSSIBLE RISK OF DATA D. Undesirable delay
MIGRATION IN CLOUD An underappreciated danger of cloud migration is
increased latency. Your business could be significantly harmed
The shift to the cloud has several advantages. Among by your app's brief delay. Customers can become frustrated
other benefits, moving your company's activities to the cloud with delay, and it can negatively affect the reputation of your
can save money, boost productivity, and provide improved brand. There are several potential remedies for latency
security. You must carefully create a cloud migration project problems, but keeping some of your data on-premises may
plan if you ultimately decide to migrate to the cloud. Several make sense if they fail or are prohibitively expensive.
cloud migration technologies will be helpful throughout this
process. But if you're considering moving to the cloud, there are E. Loss of data
a few risk considerations you should keep in mind. These There is always a chance that data will be lost when it is
consist of: moved to a new location to be stored. You might discover that
some of your files are missing, imperfect, or corrupt, whether
A. Absence of a well-defined cloud migration plan as a result of different technical problems or human error. Make
Without giving it any thought, many people get sucked sure your CSP has alternatives for data backup, restore, and
into the hoopla and rush to switch to the cloud. Before diving fallback. Having your data backed up by multiple cloud
headfirst into cloud computing, there are a lot of things you providers is a good idea so you won't have to worry about an
should think about. You should also develop a detailed strategy individual service going offline unexpectedly. Additionally, it
for transitioning to the cloud. Consider the benefits you want is a smart idea to backup all of your crucial data to a drive.
from moving to the cloud as well as your reasons for doing so.
Think about the data you want to move to the cloud as well as F. Reduced control and visibility
how much of it. You might want to preserve certain particularly Performance can be impacted by the very real risk of lack
sensitive or important data on-premises. Decide on the amount of visibility in the public cloud while moving to the cloud. You
of storage you need as well as the number of potential cloud have complete control over all of your resources, rules, and
providers [9]. infrastructure when your data is housed on-premises. But when
utilizing external cloud services, some of these responsibilities
B. Security dangers are transferred to the cloud service provider (CSP), which might
These are most likely the greatest dangers that businesses reduce your business's visibility [11-19].
moving to cloud computing must deal with. Insecure APIs,
unintentional mistakes, malware, external assaults, and more V. CONCLUSION
are just a few of the security dangers that come with moving to
the cloud, in addition to compliance violations and contractual In this paper, we have studied about how data warehouse
breaches. You must be aware of these hazards and be prepared works the ETL workflow. Data migration concept and its types
to handle them before switching to the cloud. in detail so that we can conjure the idea of what should be done
to minimize the data loss. Later, we have seen that what the
C. Overspending various risks are in migrating data in cloud platform. A move to
Although cloud companies' pricing structures are cloud is desirable but one should formulate a good strategy
adaptable, they are frequently challenging to comprehend. This beforehand in order to eliminate any risk that can jeopardize the
can and occasionally does result in up to 70% of cloud migration [20-22].
computing costs being wasted. The cost of cloud computing
varies, and each supplier will have a varied set of services and REFERENCES
costs to provide. The perfect combination can be difficult to
decide. You risk wasting a lot of money if you don't make the [1]. N Prasath, J Sreemathy; A New Approach for Cloud Data
necessary calculations to determine exactly what you need Migration Technique Using Talend ETL Tool, 7th
(Figure 8). International Conference on Advanced Computing &
Communication Systems (ICACCS), 2021
[2]. Image available at: https://www.javatpoint.com/data-
warehouse-architecture
[3]. Image available at: https://blog.mirus.com/what-is-an-
enterprise-data-warehouse
[4]. Image available at: https://www.javatpoint.com/data-
warehouse-operational-data-stores
[5]. Image available at:
https://docs.oracle.com/cd/A81042_01/DOC/server.816/
a76994/marts.htm
Fig.8. Overspending in the cloud [10]

IJISRT22AUG232 www.ijisrt.com 48
Volume 7, Issue 8, August – 2022 International Journal of Innovative Science and Research Technology
ISSN No:-2456-2165
[6]. Image available at:
https://www.auratechnology.com/auranews/blog/data-
migrations/
[7]. Image available at: https://blog.bismart.com/en/how-to-
choose-the-right-etl-tool
[8]. https://www.ibm.com/cloud/learn/data-migration
[9]. Netanya Karni; Risk in cloud migration, available at:
https://www.peerspot.com/articles/risks-in-cloud-
migration#:~:text=Migrating%20to%20the%20cloud%2
0involves,equip%20yourself%20to%20handle%20them.
[10]. Image available at:
https://cdn.ttgtmedia.com/rms/onlineImages/cloud_comp
uting-overspending_in_the_cloud-f_desktop.png
[11]. Saranya N, Brindha R; Data migration using ETL
Workflow, 7th International Conference on Advanced
Computing & Communication Systems (ICACCS), 2021.
[12]. Lei yang; QoS Guaranteed Resource Allocation for Live
Virtual Machine Migration in Edge Clouds, School of
Software Engineering, South China University of
Technology, Guangzhou 510641, China
[13]. Dr. Prakash Kumar, International Journal of Computer
Science and Mobile Computing, Vol.6 Issue.12,
December- 2017 | ISSN 2320–088X, pp. 157-163.
[14]. Dr. Prakash Kumar, International Journal of Creative
Research Thoughts, Volume 6, Issue 2 April 2018 | ISSN:
2320-2882, pp. 428-434.
[15]. Dr. Prakash Kumar, Security Issues in Vehicular network”
in JESMT Vol. 2 | Issue 1, 2012, ISSN 2231- 1521.
[16]. “Survey on Tools & Technologies used in Semantic web
and IOT” IJCRT, Vol. 6 | Issue 2, June 2018, ISSN2320-
2882,pp. 302-306
[17]. “A Novel Software development life Cycle Model for
Developing Software Project”, IJCRT, Vol. 6 | Issue 2,
April 2018, ISSN 2320-2882, pp. 428-434.
[18]. “Security Aspects in Social Networking Model “IJCRT,
UGC Approved, Vol.6| Issue 1, Jan 2018, ISSN 2320-
2882.
[19]. “Cloud Database Services improve using load balancing
Technique” in JETIR Vol.8| Issue 1, 2021, ISSN 2349-
5162, pp.822-826
[20]. Sourav Kumar Upadhyay, Dr S.C. Dutta, Dr. Prakash
Kumar; Impact of the Internet Shutdown in Ranchi,
Jharkhand: A Survey, 2022. Available at:
http://www.jetir.org/view?paper=JETIR2207676
[21]. Purushottam Kumar, Dr. Prakash Kumar; A survey on
Load balancing in Cloud Computing. Available at:
http://www.jetir.org/view?paper=JETIR2207664
[22]. Sourav Kumar Upadhyay, Dr. S.C. Dutta, Dr. Prakash
Kumar; A review on the risk and its countermeasures in
cloud environment; JETIR, Volume 9 Issue 8, 2022,
ISSN-2349-5162

IJISRT22AUG232 www.ijisrt.com 49

You might also like