Sense & Scale

A site to explore cultures, cities, and computing at varying senses and scales. Updated by Ar Ducao, with content from classes at NYU, MIT, CUNY and more.

Contact: see syllabi

  • Announcements and Links New books and blogs Storefront for Art and Architecture show Visualized Conference Open Society / Knight Challenge Codepen and JSFiddle Networks in Equity and Sustainability Agenda 6:30-7:15: Discussion of NYC Open Data query assignment. 7:15-8:00: Catherine Cramer lecture + Q&A 8:00-8:10: Break 8:10-9:20: Visualization Design Activity: 8:10-8:15: Individual…

    Announcements and Links

    Agenda

    • 6:30-7:15: Discussion of NYC Open Data query assignment.
    • 7:15-8:00: Catherine Cramer lecture + Q&A
    • 8:00-8:10: Break
    • 8:10-9:20: Visualization Design Activity:
      • 8:10-8:15: Individual brainstorming (with post-its, etc).
      • 8:15-8:30: Idea forming in small groups. Diagramming on bigger sheets of paper.
      • 8:30-8:45: Literature search to validate and provide background
    • 8:45-9:20: Brief Presentations

    Notes

    Next Week’s Assignments

    • Blog Post 1, due Sept 24: Visualization of your NYC Open Data. Use Socrata’s example page, the D3 site, and Moment.js to turn your dataset into a simple visualization. You are welcome to use my bar chart tutorial to get started. Post a link to your vis on your blog. We will do a deep dive with these technologies next week.
    • Blog Post 2, due Oct 1: Ideation sketches/pictures from the NYSCI “big data for little kids” exercise, a short bibliography (using MLA citation format) of your scholarly resources, and reflections on the activity. We will revisit this in two weeks.
    +
  • Background   What is Socrata? Socrata is a Seattle-based company, originally founded as Blist in 2007, that has engineered the platform for a number of open government databases, including that of New York, Chicago, Baltimore, the White House, and a number of federal agencies. Socrata catalogs all its open datasets…

    Background

     

    What is Socrata?

    Socrata is a Seattle-based company, originally founded as Blist in 2007, that has engineered the platform for a number of open government databases, including that of New York, Chicago, Baltimore, the White House, and a number of federal agencies. Socrata catalogs all its open datasets here.
    It also hosts community hackathon sites here.
    SODA = Socrata Open Data API
    SoQL = Socrata Query Language

    Why are we learning Socrata tools?

    Socrata platforms are the gateway to a number of government datasets you may want to use. I also like that the platforms are well-structured, relatively well-documented, and offer an SQL-like query language that can be a good preview to using MySQL and other database packages.

    Do I always have to use SoQL to get Socrata data?

    Not at all! Sometimes a dataset is small enough that you can download it through the Socrata web UI. But in the case that you are dealing with massive datasets like the NYC 311 callbase, you will need to be able to use SoQL to get exactly what you want (unless you want to wait for hours and days to download massive datasets).

    Getting Started: NYC Open Data

    Please peruse the Socrata Open Data API Docs to understand the following NYC Open Data queries. I wanted to query this month’s 311 noise complaints in in my neighborhood (zip code 11231), so I obtained the dataset’s API endpoint, then I modified the format extension to output CSVs instead of JSON, and finally I added filter and query parameters to obtain this month’s noise complaints in 11231.

    Try pasting the following queries as URLs in your browser; for each query, a small a CSV will download to your computer.

    • First, try using a simple filter to get some 311 calls in 11231. The default number of records is 1000, and the default start date is in 2010.
    https://data.cityofnewyork.us/resource/erm2-nwe9.csv?incident_zip=11231
    • I want to start making more complex queries to obtain more recent records just from my neighborhood, so I changed the syntax to the SoQL format.
    https://data.cityofnewyork.us/resource/erm2-nwe9.csv?$where=incident_zip='11231'
    • Then I formed a query to increase the output to 10,000 records:
    https://data.cityofnewyork.us/resource/erm2-nwe9.csv?$where=incident_zip='11231'&$limit=10000
    • And a query for records created on or after September 1 2015:
    https://data.cityofnewyork.us/resource/erm2-nwe9.csv?$where=created_date >='2015-09-01T00:00:00'
    • And a query for noise complaints only:
    https://data.cityofnewyork.us/resource/erm2-nwe9.csv?$where=starts_with(complaint_type,'Noise')
    • Finally, I combined all the previous queries to get exactly what I wanted.
    https://data.cityofnewyork.us/resource/erm2-nwe9.csv?$where=starts_with(complaint_type,'Noise') AND created_date >='2015-08-01T00:00:00' AND incident_zip='11231'

     

    Now Try It!

    Expand the NYC Open Data exercise by looking at December 2014, the month of protests around Eric Garner’s death. Form a query that outputs a CSV with the following attributes:

    • data source: NYC Open Data
    • time period: December 2014
    • CSV size: smaller than 1 MB

    Start Visualizing

    If you have some visualization experience, you can use the Socrata examples to help you turn your new dataset into a visualization. This will be due in two weeks. If you aren’t there yet, give it a try. I’ll post a brief tutorial on visualizing your data next week.

    And a Handy Thing

    You will probably be using Excel, Google Sheets, Numbers, or another spreadsheet tool to view your data. It can be very handy–especially when you’re annotating your vis or writing a report on it–to know how to put together basic formulas using spreadsheet functions. Since we all have access to Google via our NYU addresses, here are some basic how-tos on putting together a formula using Google Sheet functions:

    + ,
  • Announcements and Links Groups Next week: NYSCI ideation, lit review exercise Punctuality and assignments Agenda 6:30-7:00- Book Club 7:00-8:00- Lecture: Data Vis Toolbox 8:00-8:10- Break 8:00-8:45- Review of class blogs 8:45-9:15: Discussion of spreadsheet functions and database queries 9:15-9:20- Next Week’s Assignments Notes The full Socrata/SODA/SoQL tutorial is now up! Please use it…

    Announcements and Links

    • Groups
    • Next week: NYSCI ideation, lit review exercise
    • Punctuality and assignments

    Agenda

    • 6:30-7:00- Book Club
    • 7:00-8:00- Lecture: Data Vis Toolbox
    • 8:00-8:10- Break
    • 8:00-8:45- Review of class blogs
    • 8:45-9:15: Discussion of spreadsheet functions and database queries
    • 9:15-9:20- Next Week’s Assignments

    Notes

    The full Socrata/SODA/SoQL tutorial is now up! Please use it to complete the assignments below.

    Next Two Week’s Assignments

    • Data Acquisition, required by September 17. Expand the NYC Open Data exercise by looking at December 2014, the month of protests around Eric Garner’s death. Post or send me a CSV with the following attributes:
      • data source: NYC Open Data
      • time period: December 2014
      • CSV size: smaller than 1 MB, if you’re e-mailing it
    • Blog Post, required by September 17. Write a short blog post that contains a link to your CSV if possible, discusses your process, and explains your observations. Also, discuss other data sources you’d like to explore in future classes. Also, post a visualization if you get that far!
    • Visualization, required by September 24. Use Socrata’s example page and the D3 site to start turning your dataset into a simple visualization. If you don’t get that far on the visualization part, that’s ok! We will do a deep dive with these technologies on September 24.
      • If you’re having trouble deciding what to visualize, just try making a simple bar chart of NYC data in December 2014.
    +
  • Now with final projects. In alphabetical order by first author. Good work everyone!   Company Acquisitions (1980-2015) by Hovsep Agop & Sandra Song A Visual Exploration of US Election Project Data by Helen Carey & Sriya Sarkar Major World Locations’ Live Temperature Anomalies by Neill Chua & Patrick Moraitis Beyond…

    +
  • Announcements While you’re waiting for class to begin, please take a look at the links as food for thought. Example Vis: Transit Visualization Client, Conversation Concept Map, MIT ML Pantheon Example Data Source: Socioeconomic Data and Applications Center (SEDAC), NYC Open Data Example application of data visualization: The Next America, The Search for High Energy…

    Announcements

    While you’re waiting for class to begin, please take a look at the links as food for thought.

    Agenda

    • 6:30-7:00- Syllabus Explanation/Discussion
    • 7:00-7:20- Introductions
    • 7:20-8:00- Lecture: A Brief History of Data Visualization
    • 8:00-8:40- Break & Data Vis Sculptures
    • 8:40-8:55- Reconvene
    • 8:55-9:15- Discussion
    • 9:15-9:20- Next Week’s Assignments

    Next Week’s Assignments

    • BOOKS: Go to the library, a bookstore, or an online retailer to get actual, printed book(s) for our DATA VIS BOOK CLUB next week. You can get a recommended book from the syllabus and/or get a book that inspires you. Add your book to this list so we don’t have too many copies of the same book.
    • BLOG: If you don’t have a blog already, set one up. Send me the blog URL. Add a brief blog post that
      • introduces yourself and why you’d like to take this class.
      • includes a link to a data visualization that inspires you. It could be a small visualization that you aspire to make through this class, or a large, long-term team effort. It could be a screen-based visualization or documentation from a tangible visualization (ie a haptic tool or a sculpture). It could be something that you’ve worked on. Briefly describe why it inspires you.

    Notes

    Photos from our Data Vis Sculptures. Captions and more notes to come.

    IMG_4623

    IMG_4627

    IMG_4625

     

    IMG_4630

    IMG_4638

    IMG_4633

     

    IMG_4639

    +
  • Data Visualization from 2D to 4D [Screen Graphics to Physical Objects] DM-GY 9103, Fall 2015 Prof. Arlene Ducao, arlduc [at] nyu.edu Thursdays, 6:30-9:20 PM 2 Metrotech, Room 811 Overview What is data visualization? Why and how do we do it? This course will take you through the process of understanding…

    Data Visualization from 2D to 4D
    [Screen Graphics to Physical Objects]

    DM-GY 9103, Fall 2015
    Prof. Arlene Ducao, arlduc [at] nyu.edu
    Thursdays, 6:30-9:20 PM
    2 Metrotech, Room 811


    Overview

    What is data visualization? Why and how do we do it? This course will take you through the process of understanding data visualization role’s in our information landscape, evaluating the kind of data that is best for visualization, and implementing the techniques used to create 2D, 3D, and 4D visualizations. Prerequisites: a basic understanding of HTML, CSS, and one scripting language, i.e. Javascript.

    Learning Goals

    • To understand the history, functionality, and anatomy of data visualization.
    • To classify data and information visualization based on temporal, spatial, tangible, and contextual criteria.
    • To choose and apply the appropriate tools for developing a wide array of basic data visualizations.
    • To plan and execute a complex data visualization project based on audience-centric design principles including significance, relevance, and usability.

    Class Format

    • First part (60-90 minutes): Lecture, discussion, critique.
    • Second part (90-110 minutes): Hands-on building & testing. Early sessions will offer technical how-tos and labs, later sessions will offer open work time for your projects.

    Schedule

    Note: Guest lecturers and critics are subject to change.

    • Class 1: September 3
      • Lecture: A Brief History of Data Visualization.
      • Activity: Introductions and Group Exercise.
    • Class 2: September 10
      • Lecture: Data Vis Toolbox.
      • Activity: Book sharing; Tabular and Query tools.
    • Class 3: September 17
      • Lecture: Catherine Cramer, New York Hall of Science.
      • Activity: Finish Tabular and Query tools; Lit Review exercise. 
    • Class 4: September 24
      • Lecture: Chris Willard, Guidewire Software.
      • Activity: Start brainstorming for midterm.
    • Class 5: October 1
      • Lecture: Catherine D’Ignazio, Emerson Engagement Lab.
      • Activity: Geo-spatial vis tools. 
    • Class 6: October 8
      • Lecture: Ekene Ijeoma, multi-dimensional cartographies.
      • Lecture: Kevin Miklasz, Function vs. aesthetics in data visualization: some case studies.
      • Activity: Student-student critiques and open work time.
    • Class 7: October 15
      • MIDTERM with guest critics De Angela Duff and Holly Orr, NYU.
    • Class 8: October 22
      • Lecture: Bex Hurwitz, Research Action Design. 
      • Activity: Human-centered design exercise (design principles).
    • No Class October 29.
    • Class 9: November 5
      • Lecture: Austin Lee, Microsoft and Carnegie Mellon.
      • Activity: CAD and Motion Graphics Tools. 
    • Class 10: November 12
      • Lecture: Rafi Santo, Mozilla Hive. 
      • Lesson: Kevin Miklasz on R and statistical significance. 
      • Activity: Additional tutorials and troubleshooting as needed.
    • Class 11: November 19
      • Lecture/lab: Richard The, Google and SVA.
      • Lecture/lab: Peiqi Su, NYU ITP.
    • Class 12: December 3.
      • Lecture: Sha Hwang, Healthcare.gov, Gifpop, more.
      • Activity: Prepare for FINAL
    • Class 13: December 10
      • FINAL with guest critics Amy Yu (Viacom) and Birago Jones (Indicator Ventures, UBQ)

    Recommended Tools

    • A tabular software environment (Excel, Google sheet, Zoho sheet, etc)
    • A relational software environment or interface (MySQL, Tableau, SODA)
    • Chrome (for javascript development)
    • Processing
    • A cartographic package (i.e. TileMill, CartoDB, QGIS)
    • Cytoscape or other network graph package (D3 could also work)
    • A free 3D CAD tool (I recommend trying OpenSCAD and TinkerCAD)
    • Quartz Composer (for Mac only)

    Recommended Books (to be discussed in Class 1)

    Foundational Books

    Recent Books

    Office Hours

    Thursday by appointment. E-mail arlduc [at] nyu.edu to make an appointment.

    Grading

    Note: Working in groups is strongly encouraged.

    • 25% Midterm: Demonstration of prototype & two-page paper with MLA-formatted bibliography.
    • 35% Final: Demonstration of prototype & four-page paper with MLA-formatted bibliography.
    • 20% Class participation.
    • 20% Blog posts based on class discussion and project development. At least ten posts are required from the entire semester (five by midterms, five by finals).
    • Encouraged extra credit options:
      • Expanded blogging
      • Video documentation
      • Project web site
      • Conference paper

    Attendance

    Attendance to all class sessions is mandatory. Class starts at 6:30 sharp. Excused absence requests, i.e. for a religious holiday or a conference, must be made at least 3 business days ahead of the scheduled absence. Emergency absences must be accompanied by official documentation, i.e. a doctor’s note or MTA notice. One letter grade drop will occur for every two unexcused late arrivals or one unexcused absence. For additional NYU School of Engineering Academic Policies and Requirements, please consult this link.

    Moses Statement

    If you are student with a disability who is requesting accommodations, please contact New York University’s Moses Center for Students with Disabilities at 212-998-4980 or mosescsd@nyu.edu.  You must be registered with CSD to receive accommodations.  Information about the Moses Center can be found at www.nyu.edu/csd. The Moses Center is located at 726 Broadway on the 2nd floor.
    +
  • It was a beautiful day to be stationed on the 9th floor of NYU’s Kimmel Center. With floor-to-ceiling windows overlooking Washington Square and Chelsea at our backs, each group of our QSAT class presented its final results to guest critics J.D. Godchaux of NiJeL.org and Alyssa Wright of Mapzen. A potluck snack table…

    It was a beautiful day to be stationed on the 9th floor of NYU’s Kimmel Center. With floor-to-ceiling windows overlooking Washington Square and Chelsea at our backs, each group of our QSAT class presented its final results to guest critics J.D. Godchaux of NiJeL.org and Alyssa Wright of Mapzen. A potluck snack table and an additional guest (Clàudia’s friend Nori) added a festive air to the excitement that often surrounds the last day of class.

    As I mentioned at the end of class, I am quite proud of each group’s leap forward in the last weeks and days of class. After midterms, in which most projects were established, and regular check-ins with each group, I thought I knew what to expect for the final presentations, but every group exceeded my expectations. Congratulations to all of this year’s QSAT students.

    In the first half of this semester, the focus was primarily on building technical skills, while in the second half, the focus was on contextualization (background literature review, user studies, persona design). Add to this a guest speaker to spark discussion in each class, and it’s a lot to pack into a semester. I think we had a good balance of topics and a great balance of students, but I think the challenge for me in the next iteration of this class is to think through some sequencing issues: perhaps ask students to build simpler, faster prototypes so that they can collect data over longer periods of time, and cover persona design and lit review earlier in the semester. This could make the start of the semester top-heavy, so it could make sense to have fewer speakers (though I feel that the speakers really help me as an instructor).

    But back to the present. Here are QSAT 2015’s final projects, in order of presentation on April 28. You can review the progression of each project, and my comments on project progress, by launching student blogs from the Class Blogs page.

    RoomSense

    Collective Pulse

    PulseMap

    RailSense (plus post on Persona Exercise)

    Spectral Scan (plus written paper)

    SQUID

    Above: Class photo from finals day. Top, left to right: Changyeon, Bartosz, Michael, Will, Matt, Dimas, Tong, Tanya, Kania, Graham. Bottom, left to right: Greg, Justin, Varun, me, Clàudia.

     

    + ,
  • Announcements Simband development kits Worst Pothole in Brooklyn Expectations for the final. See assignment below. Next week’s FINAL: Location: Kimmel Center, Room 903 Sign up for potluck here. TODAY’S AGENDA 12:10-12:15: Announcements 12:15-12:35: Colleen Kaman, IBM iX 12:35-12:55: Bex Hurwitz, RightsCon 12:55-1:15: Q&A 1:15-1:25: BREAK 1:25-3:00: Play/user/demo Sessions and work time.…

    Announcements

    TODAY’S AGENDA

    • 12:10-12:15: Announcements
    • 12:15-12:35: Colleen Kaman, IBM iX
    • 12:35-12:55: Bex Hurwitz, RightsCon
    • 12:55-1:15: Q&A
    • 1:15-1:25: BREAK
    • 1:25-3:00: Play/user/demo Sessions and work time. Be prepared to play test, user test, demo, or discuss with Bex or Colleen in the time slot below. When you’re not in a feedback session, please work on your project. You’re also welcome to grab classmates or me for additional feedback.
    1:25 – 1:50 PM 1:55 – 2:20 PM 2:30 – 2:55 PM
    Bex Hurwitz Collective Pulse Spectral Light RailSense
    Colleen Kaman SQUID RoomSense Pulse Map

     

    ASSIGNMENT FOR NEXT WEEK: Final Presentation!!!

    • PRESENTATION. Please prepare a 10-15 presentation for next week. I will publish a schedule in a few days.
    • DEMO: Weather permitting, we will have a low-key, outdoor demo in the last 30-60 minutes of class, so please bring the technology you have, and try to make it modular. I won’t evaluate you on your project “functionality” during demo time, but it will be great to see, celebrate, and discuss your project in action!
    • DELIVERABLES. Please include in a final post on all of your blogs, due May 1.
      • your slides
      • your write-up, which should integrate these previous assignments:
        • your literature review.
        • your data visualization(s).
        • a reflection on the user/persona design exercise and process (with screenshot of class drawing exercise).
        • please use MLA-style citations (use parenthetical references) and include bibliography in MLA format.
      • a simple electronics wiring diagram
      • optional additional materials (video, photos of your team collecting data, etc)
    • If you’re borrowing any materials from me, you must return them by May 1 in order to receive a grade.
    • EVALUATION CRITERIA [more info here]
      • significance and usability of concept
      • execution of concept (instrument deployment)
      • significance of data (data collection, visualization)
      • how the project fits into “Quantified Self”
      • how the project fits into “About Town”
      • explanation of next steps or future work
    + ,
  • [post in progress] TODAY’S AGENDA Announcements Next week’s class Final class: potluck? Game Research Methods Project updates. Be ready to show your data vis and talk about your plan for our last two weeks! Poem: “User“ Hands-On Session: Persona and User Stories. BREAK Persona Presentations. Project-development time. I will spend…

    [post in progress]

    TODAY’S AGENDA

    • Announcements
    • Project updates. Be ready to show your data vis and talk about your plan for our last two weeks!
    • Poem: “User
    • Hands-On Session: Persona and User Stories.
    • BREAK
    • Persona Presentations.
    • Project-development time. I will spend some time with each of your groups.

    ASSIGNMENT FOR NEXT WEEK

    • Briefly reflect on the persona(s) developed for your project. How did it change your perception of your own project? How will it help you going forward?
    • Prepare your project for 90 minutes of USER/PLAYTESTING next week. Demo and allow usage of the parts that don’t work; make paper or cardboard prototypes for the parts that aren’t yet implemented. Be ready to show the data you have so far, as well.
    + ,
  • TODAY’S AGENDA Announcements JHU: Preparing your data for analysis NYU Wagner seeking grad students to be Transportation RAs Lecture: Crowd and open data, by Sarah Kauffman, NYU Wagner. BREAK Literature Review Updates Further visualization options Standalone: Leaflet, D3, Kartograph, Polymaps Hosted accounts: MapBox/TileMill, CartoDB, etc MR samples for TileMill. WIRED…

    TODAY’S AGENDA

    ASSIGNMENT FOR NEXT WEEK:

    • Collect some data using the instruments you’ve previously developed.
    • Visualize the data using any package of your choice, and do an eyeball examination for patterns.
    • Be sure to post your visualization BEFORE NEXT CLASS! This way, you can briefly discuss your findings, and the next steps leading to the final project, with all of us.
    + ,