Your browser doesn't support the features required by impress.js, so you are presented with a simplified version of this presentation.

For the best experience please use the latest Chrome, Safari or Firefox browser.

CIS 115

Lecture 18: Big Data
Dr. William Hsu

Data Sources

Image Source: Data Science Central

Integrating Data - Wrappers

Image Source: Wikipedia

Integrating Data -
Extract, Transform, Load (ETL)

Image Source: Wikipedia

MapReduce

Image Source: Techspot

Big Data Uses

  • Topic Modeling
  • Natural Language Processing
  • Analytics & Data Forecasting
  • Sentiment Analysis & Crowdsourcing
  • Information Visualization
  • Thematic Mapping

Healthmap.org

Image Source: Healthmap.org

Image Source: Hendra Setiawan

Image Source: Hendra Setiawan

Image Source: Hendra Setiawan

MapReduce DNA

  • Count the number of occurences of each base in a DNA sequence
  • Need Mappers, Sorters, and Reducers

MapReduce DNA

Answers:

  • a - 1514
  • c - 1076
  • t - 1613
  • g - 837

Assignments

  • Read and be prepared to discuss:
    • Tubes Chapter 7: Where Data Sleeps
  • Blog 8: Artificial Intelligence Everywhere - Due 3/30 10:00 PM
  • HTML & CSS Project - Due 3/31 10:00 PM

Blog 8: Artificial Intelligence Everywhere

In today's world, it seems like artificial intelligence is everywhere. From the games we play, to the websites we use, to the systems governing our traffic and energy supply, artificial intelligence is everywhere. For this blog post, choose one example of artificial intelligence you interact with on a regular basis, and tell us about it. Some things you can include in your article:

  • Who created it (or the first example of such a system)?
  • How does it work? What algorithms or techniques are being used?
  • What makes this system useful? Does it have any negative factors?
  • How would things be different without this system?
  • Is this system useful? Necessary? Overkill? Dangerous? Frivolous?