...------------------------------------------------- BIG Data February 8, 2015 Srinivas gogineni SAI SRAVAN KOLUKULA February 8, 2015 Srinivas gogineni SAI SRAVAN KOLUKULA Introduction Big data burst upon the scene in the first decade of the 21st century. The first organizations to embrace it were online and startup firms. Firms like Google, eBay, LinkedIn, and Facebook were built around big data from the beginning. Like many new information technologies, big data can bring about dramatic cost reductions, substantial improvements in the time required to perform a computing task, or new product and service offerings. Davenport.T (2013). Big Data is emerging from the realms of science projects at Web companies to help companies like telecommunication giants understand exactly which customers are unhappy with service and what processes caused the dissatisfaction, and predict which customers are going to change carriers. To obtain this information, billions of loosely-structured bytes of data in different locations needs to be processed until the needle in the haystack is found. The analysis enables executive management to fix faulty processes or people and maybe be able to reach out to retain the at-risk customers. The real business impact is that big data technologies can do this in weeks or months, four-or-more-times faster than traditional data warehousing approaches. Floyer.D (2015). Literature Review The IT techniques and tools to execute big data processing...
Words: 4913 - Pages: 20
...or traditional data processing applications. The challenges include capture, curation, storage,[3] search, sharing, transfer, analysis,[4] and visualization. The trend to larger data sets is due to the additional information derivable from analysis of a single large set of related data, as compared to separate smaller sets with the same total amount of data, allowing correlations to be found to "spot business trends, determine quality of research, prevent diseases, link legal citations, combat crime, and determine real-time roadway traffic conditions."[5][6][7] As of 2012, limits on the size of data sets that are feasible to process in a reasonable amount of time were on the order of exabytes of data.[8][9] Scientists regularly encounter limitations due to large data sets in many areas, including meteorology, genomics,[10] connectomics, complex physics simulations,[11] and biological and environmental research.[12] The limitations also affect Internet search, finance andbusiness informatics. Data sets grow in size in part because they are increasingly being gathered by ubiquitous information-sensing mobile devices, aerial sensory technologies (remote sensing), software logs, cameras, microphones, radio-frequency identification readers, andwireless sensor networks.[13][14] The world's technological per-capita capacity to store information has roughly doubled every 40 months since the 1980s;[15] as of 2012, every day 2.5 quintillion (2.5×1018) bytes of data were created.[16] The...
Words: 356 - Pages: 2
...4. 4.1 Big Data Introduction In 2004, Wal-Mart claimed to have the largest data warehouse with 500 terabytes storage (equivalent to 50 printed collections of the US Library of Congress). In 2009, eBay storage amounted to eight petabytes (think of 104 years of HD-TV video). Two years later, the Yahoo warehouse totalled 170 petabytes1 (8.5 times of all hard disk drives created in 1995)2. Since the rise of digitisation, enterprises from various verticals have amassed burgeoning amounts of digital data, capturing trillions of bytes of information about their customers, suppliers and operations. Data volume is also growing exponentially due to the explosion of machine-generated data (data records, web-log files, sensor data) and from growing human engagement within the social networks. The growth of data will never stop. According to the 2011 IDC Digital Universe Study, 130 exabytes of data were created and stored in 2005. The amount grew to 1,227 exabytes in 2010 and is projected to grow at 45.2% to 7,910 exabytes in 2015.3 The growth of data constitutes the “Big Data” phenomenon – a technological phenomenon brought about by the rapid rate of data growth and parallel advancements in technology that have given rise to an ecosystem of software and hardware products that are enabling users to analyse this data to produce new and more granular levels of insight. Figure 1: A decade of Digital Universe Growth: Storage in Exabytes Error! Reference source not found.3 1 ...
Words: 22222 - Pages: 89
...BIG DATA - THE MANAGEMENT REVOLUTION Summary The general theme of the article is to proof how data-driven decisions are better for businesses as data enables the managers to base their decision on evidence rather than intuition. The idea behind big data is to collect all kinds of data from various sources and to effectively utilize this data to improve the financial and operational aspects of the business. Companies that operate on digital platforms like Amazon are already experts at big data and are using the predictions based on data in a deft manner. The practice of big data should not be confined to companies that operate digitally but should be implied by other businesses as well, as big data is a revolutionary practice that provides data in larger volumes with higher velocity with which data is complied and entailed and the variety in which data is available. Furthermore, modifying a company to be data-driven is not only technologically challenging but poses a copious amount of managerial challenges as well. The decision is usually based on the senior manager who has to know how to answer questions and how to embrace evidence based decision. For a company to re-organize itself to become data-driven, the manager should concentrate on improvising five areas that include, leadership, talent management, technology, decision making and company culture. The author cities instances of big data using examples of airports where PASSUR Aerospace provided a service called...
Words: 3006 - Pages: 13
...Big Data Analytics (IB9CS) Mining, processing, analysing, and visualising large data sets. Week 6 Measuring happiness Suzy Moat Tobias Preis Suzy.Moat@wbs.ac.uk Tobias.Preis@wbs.ac.uk What we’ve covered Measuring Predicting What we’ve covered Measuring Economics Health Predicting What we’ve covered Measuring Predicting Economics Economics Health Crime What we’ve covered Measuring Predicting Economics Economics Health Crime Happiness Social networks http://www.ted.com/talks/nicholas_christakis_the_hidden_influence_of_social_networks.html Twitter and happiness Positive affect Negative affect Golder and Macy (2011, Science) Twitter and happiness Positive affect Negative affect Golder and Macy (2011, Science) Facebook and happiness Own updates % positive words % negative words Kramer et al. (2014, PNAS) Negativity reduced Positivity reduced News feed Facebook and happiness Own updates More positive % positive words % negative words More negative Kramer et al. (2014, PNAS) Negativity reduced Positivity reduced News feed Facebook and happiness Own updates % positive words % negative words Kramer et al. (2014, PNAS) Negativity reduced Positivity reduced News feed Facebook and happiness Own updates % positive words % negative words Kramer et al. (2014, PNAS) Negativity reduced Positivity ...
Words: 331 - Pages: 2
...White Paper Big Data Analytics Extract, Transform, and Load Big Data with Apache Hadoop* ABSTRACT Over the last few years, organizations across public and private sectors have made a strategic decision to turn big data into competitive advantage. The challenge of extracting value from big data is similar in many ways to the age-old problem of distilling business intelligence from transactional data. At the heart of this challenge is the process used to extract data from multiple sources, transform it to fit your analytical needs, and load it into a data warehouse for subsequent analysis, a process known as “Extract, Transform & Load” (ETL). The nature of big data requires that the infrastructure for this process can scale cost-effectively. Apache Hadoop* has emerged as the de facto standard for managing big data. This whitepaper examines some of the platform hardware and software considerations in using Hadoop for ETL. – e plan to publish other white papers that show how a platform based on Apache Hadoop can be extended to W support interactive queries and real-time predictive analytics. When complete, these white papers will be available at http://hadoop.intel.com. Abstract. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 The ETL Bottleneck in Big Data Analytics The ETL Bottleneck in Big Data Analytics. . . . . . . . . . . . . . . . . . . . . . 1 Big Data refers to the large amounts, at least terabytes, of poly-structured...
Words: 6174 - Pages: 25
...CLOUD TECHNOLOGIES TO APPLY IN COUNTRYPAL INTRODUCTION CLOUD/WEB PLATFORM Cloud computing offers business owners like countrypal an opportunity to take their businesses to the next level without having to invest heavily. The entire foundation of cloud computing is based on Iaas or Infrastructure as a Service, which extensively deals with providing the hardware in a cloud. This includes servers, routers, network equipment, firewall etc. However, all this hardware is of no use without a platform to help developers develop their applications, based on which the businesses can actually operate. Cloud computing platforms, which are otherwise known as Platform as a Service (PaaS), are therefore the middle level of cloud computing, which is extremely important for businesses to utilize the infrastructure and software that are available in the cloud. In order to make the most out of cloud computing platforms, irrespective of whether you are a business owner or a developer, it is important to understand the various aspects of this middle layer. Here is a brief overview of this aspect of cloud computing. When a business requires an application to be developed for its use, usually developers invest a lot of money into gathering the requisite infrastructure for its onsite development. But with cloud computing platforms, they can be delivered on the web. So, you can easily develop your applications without investing heavy amounts of money into buying software and other necessary tools for...
Words: 1611 - Pages: 7
...De-Identified Personal Health Care System Using Hadoop The use of medical Big Data is increasingly popular in health care services and clinical research. The biggest challenges in health care centers are the huge amount of data flows into the systems daily. Crunching this BigData and de-identifying it in a traditional data mining tools had problems. Therefore to provide solution to the de-identifying personal health information, Map Reduce application uses jar files which contain a combination of MR code and PIG queries. This application also uses advanced mechanism of using UDF (User Data File) which is used to protect the health care dataset. Responsibilities: Moved all personal health care data from database to HDFS for further processing. Developed the Sqoop scripts in order to make the interaction between Hive and MySQL Database Wrote MapReduce code for DE-Identifying data. Loaded the processed results into Hive tables. Generated test cases using MRunit. Best-Buy – Rehosting of Web Intelligence project The purpose of the project is to store terabytes of log information generated by the ecommerce website and extract meaning information out of it. The solution is based on the open source Big Data s/w Hadoop .The data will be stored in Hadoop file system and processed using PIG scripts. Which intern includes getting the raw html data from the websites, Process the html to obtain product and pricing information, Extract various reports out of the product pricing...
Words: 500 - Pages: 2
...The proposed changes in the norms of the visas will cause a lot of stress in the indian it scenario specially where they are highly dependent on their visas. The companies having higher workforce under the category of H1-B visa will suffe the same, in this category unfortunately, most of Indian it companies will fall. The amount of the fee these companies will have to pay would be much higher. Any employer which have more than 50 employees and have more than 30% but less than 50% will fall under category H1B or L1. The employer needs to pay $5000 for employee who is an extra for either of the standard. The employer which has more than 50% of the employees with H1B or L1 will have to pay $10000 fee. This is going to effect companies like TCS, WIPRO and Infosys. But will be useful in case of IBM, Dell etc which are US based. The proposed reform to the H1B visa standards were under the Senate’s comprehensive bill. It has the potential to cast a profound impact on the outstanding industry.the US government has accused Indian it companies of using the visas unfairly to send the employees from india at a lower cost, which impacts job creation in US. Most IT companies have said this would impact margin by 20 to 30 basis points and they would counter it by increasing hiring in US. Also it clears the opportunity of the local US players to get sub contracting from the bigger players. The impact on the revenues of the big companies such as Cognizant, TCS will give birth to theneed...
Words: 406 - Pages: 2
...1. What are Yunnan Lucky Air’s best options? Luck Air had a great business model, and that was to follow the same model as Southwest Airlines in the United States. Because Luck Air is considered a domestic airline in China they operate on a small scale compared to major competitors and so it made economical sense to offer low-cost, high-efficiency to their customers. In 2007 Lucky Air was able to more than double the amount of passengers from the year before by using a low-cost tactic. However other airlines have also caught on to offering low-cost fares for domestic routes to their passengers. With more competitors Lucky Air has decided to look at the possibility of taking a risk and to focus on e-commerce. Backed by their parent company Hainan Airlines, Luck Air has access to one of the most advanced web portals in the Chinese airline industry. But is e-commerce a good and viable option to compete with other airlines? Taking a look at what exactly e-commerce is important to grasp the understanding of how a business can operate, stay competitive and obtain a profit in a digital world. E-commerce involves digitally enabled transactions between and among organizations and individuals. Digitally enabled transactions include all those mediated by digital technology such as over the Internet, the Web and or via mobile apps. (Laudon & Traver, 2013, p. 55). Currently Airlines in China have high distribution costs which are fees paid to travel agents, the cost of staffing...
Words: 1737 - Pages: 7
...Industry size Supermarkets & Grocery Stores Market Research Report | NAICS 44511 | Jan 2015 Shopping smart: Increasing premium brand sales and healthy eating trends will spur growth The Supermarkets & Grocery Stores market research report provides key industry analysis and industry statistics, measures market size, analyzes current and future industry trends and shows market share for the industry’s largest companies. IBISWorld publishes the largest collection of industry reports so you can see an industry’s supply chain, economic drivers and key buyers and markets. Market Share of Companies Kroger Publix Super Markets Inc. Safeway Industry Statistics & Market Size Revenue $584bn | Annual Growth 10-15 1.3% | Annual Growth 15-20 | | Profit | Employment 2,489,995 | Businesses 42,036 | Industry Analysis & Industry Trends The Supermarkets and Grocery Stores industry has grown over the past five years, benefiting from a strengthening domestic economy. As per capita disposable income has grown over this period, some consumers traded up to premium, organic and all-natural brands, helping lift industry revenue. Over the next five years, the industry is anticipated to grow as a result of rising discretionary income, albeit at a more conservative rate than in the previous five-year period. As health concerns intensify, more consumers will seek all-natural and organic products, which are priced at a premium... purchase to read more Industry Report...
Words: 2318 - Pages: 10
...Volvo Car Corporation The Volvo Car Corporation is a multi-million dollar operation; therefore, among the business comes finances, customers, employees, and various challenges; in which, all has to be manageable. With such a large company and with all these things taken into aspect, you have to wonder how the company manages to keep it all together and remain successful. With this evolving innovative world of technology corporations such as Volvo are able to implement new programs to improve the companies’ infrastructure. Volvo Car Corporation has done just that. In response the companies has integrated the cloud infrastructure into its networks by compiling data via internet and operates globally. It is now known that through the Volvo Corporation “data is now being captured for use within the vehicle itself, and also, increasingly, for transmission via the cloud back to the manufacturer” (“converting data into," 2011). In a recent case study it was stated that “Volvo is deploying a pilot solution based on Microsoft SQL Server 2012 Business Intelligence data management software and related BI technologies, including Microsoft SharePoint Server 2010 and Microsoft Office 2010” ("Volvo car corporation," 2012). The new advancement will provide clarity and efficiency, whereas, images and reports will be provided. The Power View, which will be included, creates a visual image for data and answers unexpected questions. In addition, SharePoint will enable employees and customers...
Words: 561 - Pages: 3
...Seizure detection with Bigdata / Specific Problem, Gap Different technologies are available for neuroimaging e.g. magnetic resonance imaging (MRI), functional magnetic resonance imaging (FMRI) and electroencephalography (EEG) etc. The epileptic patients are normally monitored in the neurophysiological clinics using EEG, a non-invasive, multichannel technology for recording brain’s activity. Commonly used approach for epileptic seizure detection is the analysis of scalp EEG [3]. The technology used for scalp EEG is getting better rapidly. The scalp EEG used in clinics are capable of producing data at sampling rate of 2Khz. Furthermore in some studies; the number of channels used increased from tens to thousands [4]. To have an idea of the amount of data, a continuous EEG monitoring of a patient at 256 Hz with 24 channels can approximately generate 1GB data per day. With higher sampling rate and increased number of channels, EEG can produce far more data, e.g. 500GB per day [1]. All these characteristics make processing of EEG a compute intensive and data intensive task. Real time seizure detection...
Words: 840 - Pages: 4
...In this task, I will be explaining the role of promotion within the marketing mix for McDonald’s. Product- The product is at the heart of the whole marketing process. A business must have the right types of products that meet the needs of the market, live up to customer’s expectations and deliver its said benefits. A quality product is one that meets both the wants and needs of its consumers. Example: “The Big Tasty, served with bacon, features a 100% beef patty with square cut lettuce, onions, two tomato slices, Big Tasty sauce and three slices of cheese, made with Emmental, all in a sesame-topped bun.” http://www.mcdonalds.co.uk/ukhome/product_nutrition.beef.204.big-tasty-with-bacon.html Price- Price is the next thing that is important within a company’s “marketing mix.” A promotional activity should inform customers of the price of the product being promoted. Although price may be an important deciding factor of the product being promoted, it also carries implications for quality and value. Pricing has a lot to do with how a product is perceived. If a product is priced too expensive for its perceived benefits, it will most likely not sell in significant volumes. However, if a product is priced too low, then it can easily be considered as somewhat inferior to its competition. In order for the company to find out how they can best fit into the competitive market, they must first take a good look at their competitor’s products and positioning. Example: Products such as...
Words: 981 - Pages: 4
...basic need for the digital life. Transport – now-a-days the transport is also in the way of digital i.e., online reservation using different banking systems but the people who are living the rural areas do not know about such techniques. 8. Digital rural life in various countries RURAL URBAN Education Less than required level To required level Culture Diversified Diversified Equipment To required level More than required level Infrastructure To required level To required level Communication More than required level More than required level Transport As required As required Connectivity High High Banking To required level To required level Table3: urban and rural comparison 9. Effect of big data To provide the digital life in rural places, BIGDATA is the basic factor to store the information that was required to fulfill, as there will be lot of information about the people who are the residents of the rural places for the implementing of the big data application its is required to give the lot of the support to the people in the form the education, knowledge finance, man power , etc..., there are various several factors those which will come on the screen while the implementation is in process. As per the survey information even the big companies like IBM, Google, etc will be not able to store the information that will be collected from the rural places in the globe if they want to store the information they need to make their wings for than three times larger to achieve this goal...
Words: 1096 - Pages: 5