Table of contents
  1. Meetups
    1. Data Science for Semantics, January 4, 2016, 6:30 p.m. TO BE RESCHEDULED
    2. Data Science for American Community Survey, February 1, 2016, 6:30 p.m.
    3. Kick-off Meetup: Tuesday, January 7, 2014, 6:30 p.m.
    4. Second Meetup: Tuesday, February 4, 2014, 6:30 p.m.
    5. Third Meetup: Tuesday, February 18, 2014, 6:30 p.m.
    6. Fourth Meetup: Tuesday, March 4, 2014, 6:30 p.m.
    7. Fifth Meetup: Tuesday March 18, 2014, 6:30 p.m.
    8. Sixth Meetup, Tuesday April 1, 2014, 6:30 p.m.
    9. Seventh Meetup: Tuesday, April 15, 2014, 6:30 p.m.
    10. Eight Meetup: Tuesday, May 6, 2014, 6:30 p.m.
    11. Ninth Meetup: Tuesday, May 20, 2014, 6:30 p.m.
    12. Tenth Meetup: Monday, June 2, 2014, 6:30 p.m.
    13. Eleventh Meetup: Monday, June 30, 2014, 6:30 p.m.
    14. Twelveth Meetup: Monday, July 7, 2014, 6:30 p.m.
    15. Thirteenth Meetup: Monday, July 28, 2014, 6:30 p.m.
    16. Fourteenth Meetup August 4, 2014, 9:00 a.m.
    17. Fiftheenth Meetup, Monday, August 11, 2014, 6:30 p.m.
    18. Sixteenth Meetup, Monday, September 8, 2014, 6:30 p.m.
    19. Seventeenth Meetup, Monday, September 22, 2014, 6:30 p.m.
    20. Eighteenth Meetup, Monday, October 6, 2014, 6:30 p.m.
    21. Nineteenth Meetup, Thursday, October 9, 2014, 7:30 a.m.
    22. Twentieth Meetup, Monday, November 3, 2014, 6:30 p.m.
    23. Twenty-first Meetup, Thursday, November 13, 2014, 4:30 p.m.
    24. Twenty-second Meetup, November 18-19, 2014, 8:30 a.m.
    25. Twenty-third Meetup, Monday, December 1, 2014, 6:30 p.m.
    26. Twenty-forth Meetup, Monday, January 12, 2015, 6:30 p.m.
    27. Government Technology & Innovation Incubator for Big Data Analytics, January 27, 2015, 6:30 p.m.
    28. Twenty-fifth Meetup, Monday, February 2, 2015, 6:30 p.m.
    29. Data Science for MyFamilySearch.org and FamilyTree DNA, February 16, 2015, 6:30 p.m.
    30. Twenty-sixth Meetup, Monday, March 2, 2015, 6:30 p.m.
    31. USDA CIO and ACDO on Open Data Plan and Roundtable, March 16, 2015, 6:30 p.m.
    32. Government Technology & Innovation Incubator for Big Data Analytics II, March 25, 2015, 6:30 p.m.
    33. Twenty-seventh Meetup, Monday, April 6, 2015, 6:30 p.m.
    34. President's Chief Data Scientist and EPA Big Data Analytics Meetup, Monday, April 20, 2015, 6:30 p.m.
    35. 2015 Wharton DC innovation Summit, Federal Big Data Working Group Workshop, April 29, 2015, 1 p.m.
    36. Twenty-eight Meetup, Monday, May 4, 2015, 6:30 p.m.
    37. USDA Data Science MOOC Meetup, Monday, May 18, 2015, 6:30 p.m.
    38. Data Science for NIST Big Data Framework Review by May 21, 2015
    39. Data Science for Health Datapalooza 2015, June 1-3, 2015
    40. Twenty-ninth Meetup, Monday, June 1, 2015, 6:30 p.m.
    41. Data Science for USGS Minerals Big Data Meetup, Monday, July 13, 2015, 6:30 p.m.
    42. Government Technology & Innovation Incubator for Big Data Analytics III, June 20, 2015, 10 a.m.
    43. Thirtieth Meetup, June 29, 2015, 6:30 p.m.
    44. Special Meetup at the NSF, July 15, 2015, 1 p.m.
    45. Thirty-first Meetup, July 20, 2015, 6:30 p.m.
    46. Data Science for Agency Initiatives 2015, August 3, 2015, 6:30 p.m.
    47. A NIH – Semantic Medline Data Science Data Publication Commons, August 17, 2015, 6:30 p.m.
    48. The Yosemite Project for Healthcare Information Interoperability August 31, 2015, 6:30 p.m.
    49. Big Data Science for Precision Farming Business September 14, 2015, 6:30 p.m.
    50. Climate Change & Genomic Data - Data Science Meetup of Meetups September 28, 2015, 6:30 p.m.
    51. Data Science for EPA & USGS Fracturing & Fracking­­­­­ Data, October 5, 2015, 6:30 p.m.
    52. Data Science for EPA's Chief Data Scientist: Big Data for Nutrients and Air Quality, October 19, 2015, 6:30 p.m.
    53. Data Science for Random Forests: TIBCO Enterprise Runtime for R, November 2, 2015, 6:30 p.m.
    54. Special Data Science Meetup of Data Science Meetups, November 5-6, 2015
      1. Day 1 November 5, 2015
      2. Day 2 November 6, 2015
    55. Data Science for the DataAct Datathon, To Be Rescheduled
    56. 3rd Annual Big Data for Intelligence Symposium, Nov. 17-18, 2015
    57. Data Science and Semantic Insights for DoD Joint Doctrine, December 7, 2015, 6:30 p.m.
    58. Data Science for Semantics: MarkLogic and Cray Graph Appliance Update, January 4, 2016, 6:30 p.m. TO BE RESCHEDULED
    59. Data Science for American Community Survey, February 1, 2016, 6:30 p.m.
  2. Possible Team Presentations
  3. Subcommittee on Networking and Information Technology Research and Development
  4. Co-organizers
  5. Xcelerate Solutions Office Looking South to the Spring Hill Road Silver Line Metro Station
  6. Xcelerate Solutions Conference Room
  7. Meetup Help
  8. Join.me Help

Federal Big Data Working Group Meetup

Last modified
Table of contents
  1. Meetups
    1. Data Science for Semantics, January 4, 2016, 6:30 p.m. TO BE RESCHEDULED
    2. Data Science for American Community Survey, February 1, 2016, 6:30 p.m.
    3. Kick-off Meetup: Tuesday, January 7, 2014, 6:30 p.m.
    4. Second Meetup: Tuesday, February 4, 2014, 6:30 p.m.
    5. Third Meetup: Tuesday, February 18, 2014, 6:30 p.m.
    6. Fourth Meetup: Tuesday, March 4, 2014, 6:30 p.m.
    7. Fifth Meetup: Tuesday March 18, 2014, 6:30 p.m.
    8. Sixth Meetup, Tuesday April 1, 2014, 6:30 p.m.
    9. Seventh Meetup: Tuesday, April 15, 2014, 6:30 p.m.
    10. Eight Meetup: Tuesday, May 6, 2014, 6:30 p.m.
    11. Ninth Meetup: Tuesday, May 20, 2014, 6:30 p.m.
    12. Tenth Meetup: Monday, June 2, 2014, 6:30 p.m.
    13. Eleventh Meetup: Monday, June 30, 2014, 6:30 p.m.
    14. Twelveth Meetup: Monday, July 7, 2014, 6:30 p.m.
    15. Thirteenth Meetup: Monday, July 28, 2014, 6:30 p.m.
    16. Fourteenth Meetup August 4, 2014, 9:00 a.m.
    17. Fiftheenth Meetup, Monday, August 11, 2014, 6:30 p.m.
    18. Sixteenth Meetup, Monday, September 8, 2014, 6:30 p.m.
    19. Seventeenth Meetup, Monday, September 22, 2014, 6:30 p.m.
    20. Eighteenth Meetup, Monday, October 6, 2014, 6:30 p.m.
    21. Nineteenth Meetup, Thursday, October 9, 2014, 7:30 a.m.
    22. Twentieth Meetup, Monday, November 3, 2014, 6:30 p.m.
    23. Twenty-first Meetup, Thursday, November 13, 2014, 4:30 p.m.
    24. Twenty-second Meetup, November 18-19, 2014, 8:30 a.m.
    25. Twenty-third Meetup, Monday, December 1, 2014, 6:30 p.m.
    26. Twenty-forth Meetup, Monday, January 12, 2015, 6:30 p.m.
    27. Government Technology & Innovation Incubator for Big Data Analytics, January 27, 2015, 6:30 p.m.
    28. Twenty-fifth Meetup, Monday, February 2, 2015, 6:30 p.m.
    29. Data Science for MyFamilySearch.org and FamilyTree DNA, February 16, 2015, 6:30 p.m.
    30. Twenty-sixth Meetup, Monday, March 2, 2015, 6:30 p.m.
    31. USDA CIO and ACDO on Open Data Plan and Roundtable, March 16, 2015, 6:30 p.m.
    32. Government Technology & Innovation Incubator for Big Data Analytics II, March 25, 2015, 6:30 p.m.
    33. Twenty-seventh Meetup, Monday, April 6, 2015, 6:30 p.m.
    34. President's Chief Data Scientist and EPA Big Data Analytics Meetup, Monday, April 20, 2015, 6:30 p.m.
    35. 2015 Wharton DC innovation Summit, Federal Big Data Working Group Workshop, April 29, 2015, 1 p.m.
    36. Twenty-eight Meetup, Monday, May 4, 2015, 6:30 p.m.
    37. USDA Data Science MOOC Meetup, Monday, May 18, 2015, 6:30 p.m.
    38. Data Science for NIST Big Data Framework Review by May 21, 2015
    39. Data Science for Health Datapalooza 2015, June 1-3, 2015
    40. Twenty-ninth Meetup, Monday, June 1, 2015, 6:30 p.m.
    41. Data Science for USGS Minerals Big Data Meetup, Monday, July 13, 2015, 6:30 p.m.
    42. Government Technology & Innovation Incubator for Big Data Analytics III, June 20, 2015, 10 a.m.
    43. Thirtieth Meetup, June 29, 2015, 6:30 p.m.
    44. Special Meetup at the NSF, July 15, 2015, 1 p.m.
    45. Thirty-first Meetup, July 20, 2015, 6:30 p.m.
    46. Data Science for Agency Initiatives 2015, August 3, 2015, 6:30 p.m.
    47. A NIH – Semantic Medline Data Science Data Publication Commons, August 17, 2015, 6:30 p.m.
    48. The Yosemite Project for Healthcare Information Interoperability August 31, 2015, 6:30 p.m.
    49. Big Data Science for Precision Farming Business September 14, 2015, 6:30 p.m.
    50. Climate Change & Genomic Data - Data Science Meetup of Meetups September 28, 2015, 6:30 p.m.
    51. Data Science for EPA & USGS Fracturing & Fracking­­­­­ Data, October 5, 2015, 6:30 p.m.
    52. Data Science for EPA's Chief Data Scientist: Big Data for Nutrients and Air Quality, October 19, 2015, 6:30 p.m.
    53. Data Science for Random Forests: TIBCO Enterprise Runtime for R, November 2, 2015, 6:30 p.m.
    54. Special Data Science Meetup of Data Science Meetups, November 5-6, 2015
      1. Day 1 November 5, 2015
      2. Day 2 November 6, 2015
    55. Data Science for the DataAct Datathon, To Be Rescheduled
    56. 3rd Annual Big Data for Intelligence Symposium, Nov. 17-18, 2015
    57. Data Science and Semantic Insights for DoD Joint Doctrine, December 7, 2015, 6:30 p.m.
    58. Data Science for Semantics: MarkLogic and Cray Graph Appliance Update, January 4, 2016, 6:30 p.m. TO BE RESCHEDULED
    59. Data Science for American Community Survey, February 1, 2016, 6:30 p.m.
  2. Possible Team Presentations
  3. Subcommittee on Networking and Information Technology Research and Development
  4. Co-organizers
  5. Xcelerate Solutions Office Looking South to the Spring Hill Road Silver Line Metro Station
  6. Xcelerate Solutions Conference Room
  7. Meetup Help
  8. Join.me Help

  1. Meetups
    1. Data Science for Semantics, January 4, 2016, 6:30 p.m. TO BE RESCHEDULED
    2. Data Science for American Community Survey, February 1, 2016, 6:30 p.m.
    3. Kick-off Meetup: Tuesday, January 7, 2014, 6:30 p.m.
    4. Second Meetup: Tuesday, February 4, 2014, 6:30 p.m.
    5. Third Meetup: Tuesday, February 18, 2014, 6:30 p.m.
    6. Fourth Meetup: Tuesday, March 4, 2014, 6:30 p.m.
    7. Fifth Meetup: Tuesday March 18, 2014, 6:30 p.m.
    8. Sixth Meetup, Tuesday April 1, 2014, 6:30 p.m.
    9. Seventh Meetup: Tuesday, April 15, 2014, 6:30 p.m.
    10. Eight Meetup: Tuesday, May 6, 2014, 6:30 p.m.
    11. Ninth Meetup: Tuesday, May 20, 2014, 6:30 p.m.
    12. Tenth Meetup: Monday, June 2, 2014, 6:30 p.m.
    13. Eleventh Meetup: Monday, June 30, 2014, 6:30 p.m.
    14. Twelveth Meetup: Monday, July 7, 2014, 6:30 p.m.
    15. Thirteenth Meetup: Monday, July 28, 2014, 6:30 p.m.
    16. Fourteenth Meetup August 4, 2014, 9:00 a.m.
    17. Fiftheenth Meetup, Monday, August 11, 2014, 6:30 p.m.
    18. Sixteenth Meetup, Monday, September 8, 2014, 6:30 p.m.
    19. Seventeenth Meetup, Monday, September 22, 2014, 6:30 p.m.
    20. Eighteenth Meetup, Monday, October 6, 2014, 6:30 p.m.
    21. Nineteenth Meetup, Thursday, October 9, 2014, 7:30 a.m.
    22. Twentieth Meetup, Monday, November 3, 2014, 6:30 p.m.
    23. Twenty-first Meetup, Thursday, November 13, 2014, 4:30 p.m.
    24. Twenty-second Meetup, November 18-19, 2014, 8:30 a.m.
    25. Twenty-third Meetup, Monday, December 1, 2014, 6:30 p.m.
    26. Twenty-forth Meetup, Monday, January 12, 2015, 6:30 p.m.
    27. Government Technology & Innovation Incubator for Big Data Analytics, January 27, 2015, 6:30 p.m.
    28. Twenty-fifth Meetup, Monday, February 2, 2015, 6:30 p.m.
    29. Data Science for MyFamilySearch.org and FamilyTree DNA, February 16, 2015, 6:30 p.m.
    30. Twenty-sixth Meetup, Monday, March 2, 2015, 6:30 p.m.
    31. USDA CIO and ACDO on Open Data Plan and Roundtable, March 16, 2015, 6:30 p.m.
    32. Government Technology & Innovation Incubator for Big Data Analytics II, March 25, 2015, 6:30 p.m.
    33. Twenty-seventh Meetup, Monday, April 6, 2015, 6:30 p.m.
    34. President's Chief Data Scientist and EPA Big Data Analytics Meetup, Monday, April 20, 2015, 6:30 p.m.
    35. 2015 Wharton DC innovation Summit, Federal Big Data Working Group Workshop, April 29, 2015, 1 p.m.
    36. Twenty-eight Meetup, Monday, May 4, 2015, 6:30 p.m.
    37. USDA Data Science MOOC Meetup, Monday, May 18, 2015, 6:30 p.m.
    38. Data Science for NIST Big Data Framework Review by May 21, 2015
    39. Data Science for Health Datapalooza 2015, June 1-3, 2015
    40. Twenty-ninth Meetup, Monday, June 1, 2015, 6:30 p.m.
    41. Data Science for USGS Minerals Big Data Meetup, Monday, July 13, 2015, 6:30 p.m.
    42. Government Technology & Innovation Incubator for Big Data Analytics III, June 20, 2015, 10 a.m.
    43. Thirtieth Meetup, June 29, 2015, 6:30 p.m.
    44. Special Meetup at the NSF, July 15, 2015, 1 p.m.
    45. Thirty-first Meetup, July 20, 2015, 6:30 p.m.
    46. Data Science for Agency Initiatives 2015, August 3, 2015, 6:30 p.m.
    47. A NIH – Semantic Medline Data Science Data Publication Commons, August 17, 2015, 6:30 p.m.
    48. The Yosemite Project for Healthcare Information Interoperability August 31, 2015, 6:30 p.m.
    49. Big Data Science for Precision Farming Business September 14, 2015, 6:30 p.m.
    50. Climate Change & Genomic Data - Data Science Meetup of Meetups September 28, 2015, 6:30 p.m.
    51. Data Science for EPA & USGS Fracturing & Fracking­­­­­ Data, October 5, 2015, 6:30 p.m.
    52. Data Science for EPA's Chief Data Scientist: Big Data for Nutrients and Air Quality, October 19, 2015, 6:30 p.m.
    53. Data Science for Random Forests: TIBCO Enterprise Runtime for R, November 2, 2015, 6:30 p.m.
    54. Special Data Science Meetup of Data Science Meetups, November 5-6, 2015
      1. Day 1 November 5, 2015
      2. Day 2 November 6, 2015
    55. Data Science for the DataAct Datathon, To Be Rescheduled
    56. 3rd Annual Big Data for Intelligence Symposium, Nov. 17-18, 2015
    57. Data Science and Semantic Insights for DoD Joint Doctrine, December 7, 2015, 6:30 p.m.
    58. Data Science for Semantics: MarkLogic and Cray Graph Appliance Update, January 4, 2016, 6:30 p.m. TO BE RESCHEDULED
    59. Data Science for American Community Survey, February 1, 2016, 6:30 p.m.
  2. Possible Team Presentations
  3. Subcommittee on Networking and Information Technology Research and Development
  4. Co-organizers
  5. Xcelerate Solutions Office Looking South to the Spring Hill Road Silver Line Metro Station
  6. Xcelerate Solutions Conference Room
  7. Meetup Help
  8. Join.me Help

Meetups

I would like to organize frequent results-focused meetups that provide opportunities for a broad community of participants to focus on big data products for the Federal Big Data Initiative like I have been involved in myself:

MOOC_poster_mathplourde.jpg

Poster, entitled "MOOC, every letter is negotiable," exploring the meaning of the words "Massive Open Online Course" Source: Wikpedia

Founder: Brand Niemann and Co-organizer: Ali Quershi (originally Katherine Goodier) and Host: Xcelerate Solutions with offices in Tysons Corner (8405 Greensboro Dr., Suite 930, McLean, VA 22102), VA-Capacity about 50 with Skype and wifi available). The Silver Line Spring Hill Metro Stop (planned to open in July) is across the street (Route 7 and Spring Hill Road). Directions to the building are easy and they have open underground parking. See photo below from Xcelerate Solutions Office looking south to the Spring Hill Road Silver Line Metro Station (planned to open in July 2014).

For individuals coming to the Meetup from the Metro Orange line, public transportation is available from the West Falls Church Metro via the Fairfax connector Bus 427.  Bus 427 is accessed from the North side of the West Falls Church Metro.  
Bus 427 schedule and map found here: http://www.fairfaxcounty.gov/connector/pdf/427.pdf
Bus 427 will travel down Greensboro Drive and Xcelerate Solutions will be the last stop on Greensboro Drive. The office is directly across the street from the stop.   Bus 427 leaving West Falls Church at 5:50 pm will arrive approximately at 6:11 pm (also earlier 5:35 pm bus arrive 5:50 pm) to bus stop location on Greensboro Drive.

Note: We have suggestions of other spaces, so let's start here and see what the community wants.

Data Science for Semantics, January 4, 2016, 6:30 p.m. TO BE RESCHEDULED

Data Science for American Community Survey, February 1, 2016, 6:30 p.m.

Data Science for American Community Survey

  • 6:30 p.m. Welcome and Introduction (New Tutorial and Mentoring) Slides Data Science for American Community Survey
  • 7:15 p.m. Brief Member Introductions
  • 7:30 p.m. Invited Presentation: Introduction to the American Community Survey Description: Discover the detailed social, economic, and housing statistics that the American Community Survey (ACS) provides for every community every year. You will learn about basics of the yearly estimates and datasets produced from the ACS, resources available on our website (http://census.gov/acs), and how to access ACS data products through a variety of tools with emphasis on American FactFinderDate: August 25, 2015, Time: 2pm-3pm ET, Level: Introductory
  • 8:15 p.m. Open Discussion​
  • 8:45 p.m. Networking
  • 9:00 p.m. Depart

Kick-off Meetup: Tuesday, January 7, 2014, 6:30 p.m.

Let's have our Kickoff in early January 2014

Slides

Suggested Format:

  • 6:30 p.m. Tutorials (I will start with Practical Data Science for Data Scientists - Proposed GMU Course, and hope that others would offer to do tutorials as well) and Refreshments): What is Data Science and the Data Science Process?
  • Proposed: Joint Meetup with OSTP FASTER CoP
  • 7:00 p.m. Introductions and Announcements (10 seconds per individual depending on the size of the group)
  • 7:15 p.m. Featured Presentation/Demonstration (where did you get the data, where did you store the data, and what were your results)
  • 8:30 p.m. Networking/Individual Demos (talk among yourselves and look at one another's work)
  • 9:00 p.m. Continue Your Conversations Elsewhere (We need to clear out of the space)

Getting Started:

Second Meetup: Tuesday, February 4, 2014, 6:30 p.m.

Second Meetup: Tuesday, February 4, 6:30 p.m.

Slides (Tutorial), Slides (Inside Analysis),  Slides (Healthcare.gov Data Science), and Be Informed (Video)

Announcements:

Big Data Sets Available for Free: http://www.datasciencecentral.com/pr...lable-for-free?

Strata Conference, February 11-13 (see free online available): http://strataconf.com/strata2014/pub...le/full/public

NIH Big Data-Data Science Funding Opportunities: http://bd2k.nih.gov/#sthash.QLGQEwh0.jncy9RZN.dpbs

JHU Data Science Course: https://www.coursera.org/specializat...udatascience/1

Third Meetup: Tuesday, February 18, 2014, 6:30 p.m.

Evolution of Semantic Technologies-The Value of Merging Smart Data With Big Data

6:30 pm

Slides

Continue Data Science Tutorial: Practical Data Science for Data ScientistsAsking and Answering Questions About Data

Data Science for Business: Book Review TutorialData Science for Business: EPA Waterways, Thinking with Data: Book Review Tutorial, and Data Science for Business: Semantic Verses and Semantic Verses-Klangoo

7 pm

Welcoming Remarks by Dr. George Strawn, Director, NITRD/NCO and co-chair of the Federal Big Data Senior Steering Work Group Slides 
7:05 pm

Brief Introductions (10 Seconds - Name, Organization and Interests)

7:15 pm

Evolution of Semantic Technologies-The Value of Merging Smart Data With Big Data: Eric Little, Modus Operandi and Dr. Silva. See ​Modus Operandi Semantic Knowledge Base

This talk will briefly highlight the history of semantic technologies (and ontology development) and will point out some of the successes and failures of the discipline over the past couple of decades.  We will then discuss how semantics is becoming more relevant now that orthogonally related technologies and techniques in Big Data Science have matured – in that they are providing a computational frameworks that previously was lacking.  The ability to scale data in cloud databases (e.g., NoSQL data stores) is providing incredible heuristics for managing and manipulating large amounts of data – however, the ability to provide advanced analytics within those systems is a growing challenge due to their lack of relational structure.  Semantic technologies can provide the necessary organizational structure for  advanced querying and reasoning of large graphs at levels of scale that can provide real business intelligence and advanced analytics. 

A White Paper in progress on "Making Big Data Small" Using Semantics & Advanced Analytics for the NITRD will also be discussed.

Dr. Eric Little, PhD Vice President – Chief Scientist, Modus Operandi Slides

Jeff Lessner, Vice President - Development, Modus Operandi

Dr. John Silva

8:30 pm

Networking/Individual Demos (talk among yourselves and look at one another's work)

9:00 pm

Continue Your Conversations Elsewhere (We need to clear out of the space) 

Fourth Meetup: Tuesday, March 4, 2014, 6:30 p.m.

Joint NSF-NIH Biomedical Big Data Research

NIST Data Science Symposium Agenda (Revised for Two Hour Delay​) and Proceedings

Poster Abstract for NIST Data Science Symposium

Joint NSF-NIH Biomedical Big Data Research: Euretos BRAIN

Logistics

  • Hosted at The National Science Foundation, 4201 Wilson Boulevard, Arlington, Virginia 22230.
  • A short distance from the Ballston Metro Orange Line. Free Street Parking and Paid Ballston Common Mall Parking​.
  • The building's main entrance is located at the corner of 9th and Stuart Streets.​
  • No prior clearance required for Conference Room 1-110.
  • Dinner will not be served for this Meetup. NO FOOD ALLOWED IN THE ROOM. Please Eat Before at a Nearby Place Like the Ballston Mall Food Court.
  • Start time is 6:30 p.m.
  • Restrooms are just outside.

Agenda

Meetup Slides

Note: Brief demo of Watson/IBM by Frank Stein and Chris Welty (Slides) at a future event.

Jointly designing a DATA FAIRPORT – Conference Report Final Version

DatafairportFianlReportCoverPage.png DatafairportFianlReportFigure1.png

Executive summary

This document recapitulates the discussions which took place during the first FAIRPORT unconference on the 13-16th of January 2014. The meeting aimed at defining the DATA FAIRPORT, a minimal (yet comprehensive) framework in which current issues in data discoverability, access, annotation and authoring can be addressed.

The DATA FAIRPORT will not dictate a single platform or a tightly integrated data infrastructure. Rather it will focus on conventions that enable data interoperability, stewardship and compliance against data and metadata standards, policies and practices. It
was proposed that the convention for data and model services interoperability should be based on the minimal “hourglass” approach, which is the same as the approach that underpins the internet, the web and other robust, heterogeneous yet interoperable infrastructures. The hourglass focuses on the specification of lightweight interfaces, standard protocols and standard formats to define a ‘minimal Data Fairport scope’. It was proposed that the conventions for data and model metadata descriptions be founded on community standards for: identifiers, formats, checklists and vocabularies.

The DATA FAIRPORT is not the about the development of yet more standards, it is about

  • the adoption of standards
  • the communication of standards
  • the simplification of standard interoperation
  • the adoption of cross-cutting standards for provenance, versioning, identity and dependency for data and for metadata
  • the interoperation of data services (is that another workstream)
  • the reconcilation of evolving standards and the datasets organised or annotated by them
  • the minimal models of investigation for grouping results
  • the metadata required to link data with analytics (notably models)
  • data citation ­ mechanics, adoption, recognition 

Although not part of the core DATA FAIRPORT convention, the need for minimally one reference implementation has been recognized as essential to achieve a wide adoption, and a functional demonstrator should emerge in the coming months from voluntary contributions of the attendees.

Fifth Meetup: Tuesday March 18, 2014, 6:30 p.m.

Continue Data Science Tutorial and Learn About Bigdata SYSTAP

  • Slides
  • Continue Data Science Tutorial:  Practical Data Science for Data ScientistsSpecific Data Science Tools and Applications 1 and see Graph Databases and Bigdata SYSTAP Literature Survey of Graph Databases and A Data Science Big Mechanism for DARPA. Also see NSF Funding Opportunities in Data Science
  • Bigdata SYSTAP, Michael Personick and Bryan Thompson, SYSTAP Bryan Thompson, Chief Scientist of SYSTAP, LLC will speak about their SYSTAP open source graph database platform.  Highlights will include support for highly available replication clusters as well their recent work with accelerated graph processing on GPUs at 3 billion traversed edges per second. Slides
  • Abstract:  bigdata(R) is a high performance, scalable, open source graph database platform supporting the RDF data model and edge attributes. I will provide a brief overview of the bigdata platform, summarize some of its key differentiators, including the High-Availability enterprise deployment model, an API for writing graph mining algorithms against RDF data, and efficient representation and query of edge attributes, discuss approaches for combining bigdata clusters with map/reduce processing, and provide a glimpse of new features in our roadmap, including accelerated graph processing on GPUs at 3-billion edges per second.
  • SYSTAP, LLC leads the development of the bigdata open source platform and offers consulting services related scalable information architectures and services and support for the bigdata platform. Bigdata is available under both open source and commercial licenses.

http://www.systap.com
http://www.bigdata.com/blog
http://sourceforge.net/projects/bigdata/

  • Biography: Bryan Thompson (SYSTAP, LLC) is the Chief Scientist and co-Founder of SYSTAP, LLC. He is the lead architect for bigdata®, an open source graph database used by Fortune 500 companies including EMC (SYSTAP provides the graph engine for the topology server used in their host and storage management solutions) and Autodesk (SYSTAP provides their cloud solution for graph search). He is the principle investigator for a DARPA research team investigating GPU-accelerated distributed architectures for graph databases and graph mining. He has over 30 years experience related to cloud computing; graph databases; the semantic web; web architecture; relational, object, and RDF database architectures; knowledge management and collaboration; artificial intelligence and connectionist models; natural language processing; metrics, scalability studies, benchmarks and performance tuning; decision support systems.
  • Poster 08 A High Level API for Fast Development of High Performance Graphic Analytics on GPUs​

    - Click here for PDF of poster PDF

    Zhisong Fu, SYSTAP LLC., United States

    High performance graph analytics are critical for a long list of application domains, ranging from social networks, information systems, security, biology, healthcare and life sciences. In recent years, the rapid advancement of many-core processors, in particular graphical processing units (GPUs), has sparked a broad interest in developing high performance graph analytics on these architectures. However, the single instruction multiple thread (SIMT) architecture used in GPUs places particular constraints on both the design and implementation of graph analytics algorithms and data structures, making the development of such programs difficult and time-consuming.

    We present an open source library (MPGraph) that provides a high level abstraction which makes it easy to develop high performance graph analytics on massively parallel hardware. This abstraction is based on the Gather-Apply-Scatter (GAS) model as used in GraphLab.  To deliver high performance computation and efficiently utilize the high memory bandwidth of GPUs, the underlying CUDA kernels use multiple sophisticated strategies, such as vertex-degree-dependent dynamic parallelism granularity and frontier compaction. Our experiments show that for many graph analytics algorithms, an implementation, with our abstraction, is up to two order of magnitude faster than parallel CPU implementations on up 24 CPU cores and has performance comparable to a state-of-the-art manually optimized GPU implementation. In addition, with our abstraction, new algorithms can be implemented in a few hours that fully exploit the data-level parallelism of the GPU and offer throughput of up to 3 billion traversed edges per second on a single GPU.  We will explain the concepts behind the high-level abstraction and provide a starting point for people who want to write high throughput analytics.

    MPGraph is now in its second release.  Future work will extend the platform to multi-GPU workstations and GPU compute clusters.

  • DARPA Open Catalog Source: http://www.darpa.mil/opencatalog/
XDATA Team Software Category Instructional Material Code Dev Stats Description License
SYSTAP, LLC bigdata Infrastructure 2014-07

URL

stats Bigdata enables massively parallel graph processing on GPUs and many core CPUs. The approach is based on the decomposition of a graph algorithm as a vertex program. The initial implementation supports an API based on the GraphLab 2.1 Gather Apply Scatter (GAS) API. Execution is available on GPUs, Intel Xenon Phi (aka MIC), and multi-core GPUs. GPLv2
SYSTAP, LLC mpgraph Analytics 2014-07

URL

stats Mpgraph enables massively parallel graph processing on GPUs and many core CPUs. The approach is based on the decomposition of a graph algorithm as a vertex program. The initial implementation supports an API based on the GraphLab 2.1 Gather Apply Scatter (GAS) API. Execution is available on GPUs, Intel Xenon Phi (aka MIC), and multi-core GPUs. ALv2

Sixth Meetup, Tuesday April 1, 2014, 6:30 p.m.

Marc Smith, Network Analytics, and Katherine Goodier on Big Data Privacy

  • 6:30 p.m.Welcome and Introduction Slides
  • 6:35 p.m. Continue Data Science Tutorial: Marc Smith, Connected Action, Network Analytics and Visualization of Big Data Privacy Workshop Tweets See Title and Abstract Below SlideShare
  • 7:00 p.m. Brief Member Introductions and Marc Smith Continues
  • 7:30 p.m. Dr. Katherine Goodier, Technical Arcjitect, Excelerate Solutions, Slides, Discussing Remarks by the President on Review of Signals Intelligence, excerpts:
    • But the government collection and storage of such bulk data also creates a potential for abuse.
    • Yet there is an inevitable bias not only within the intelligence community, but among all of us who are responsible for national security, to collect more information about the world, not less.
    • I have also asked my counselor, John Podesta, to lead a comprehensive review of big data and privacy.  And this group will consist of government officials who, along with the President’s Council of Advisors on Science and Technology, will reach out to privacy experts, technologists and business leaders, and look how the challenges inherent in big data are being confronted by both the public and private sectors; whether we can forge international norms on how to manage this data; and how we can continue to promote the free flow of information in ways that are consistent with both privacy and security.
    • See Recent Big Data Privacy Workshop, Co-hosted by The White House Office of Science & Technology Policy and MIT, March 3, 2014 | Cambridge, Massachusetts and The Use and Abuse of Big Data and Hadoop
    • See Information Sharing Environment and State Heath Databases
  • 8:15 p.m. Open Discussion
  • 8:30 p.m. Networking
  • 9:00 p.m. Depart

END OF AGENDA

TitleCharting Collections of Connections in Social Media: Creating Maps and Measures with NodeXL

Abstract:  Networks are a data structure commonly found across all social media services that allow populations to author collections of connections.  The Social Media Research Foundation's NodeXL project makes analysis of social media networks accessible to most users of the Excel spreadsheet application.  With NodeXL, Networks become as easy to create as pie charts.  Applying the tool to a range of social media networks has already revealed the variations present in online social spaces.  A review of the tool and images of Twitter, flickr, YouTube, and email networks will be presented.

Description: We now live in a sea of tweets, posts, blogs, and updates coming from a significant fraction of the people in the connected world.  Our personal and professional relationships are now made up as much of texts, emails, phone calls, photos, videos, documents, slides, and game play as by face-to-face interactions.  Social media can be a bewildering stream of comments, a daunting fire hose of content.  With better tools and a few key concepts from the social sciences, the social media swarm of favorites, comments, tags, likes, ratings, and links can be brought into clearer focus to reveal key people, topics and sub-communities.  As more social interactions move through machine-readable data sets new insights and illustrations of human relationships and organizations become possible.  But new forms of data require new tools to collect, analyze, and communicate insights.

A new organization, the Social Media Research Foundation (http://www.smrfoundation.org), has been formed to develop open tools and open data sets, and to foster open scholarship related to social media.  The Foundation's current focus is on creating and publishing tools that enable social media network analysis and visualization from widely used services like email, Twitter, Facebook, flickr, YouTube and the WWW. The Foundation has released the free and open NodeXL project (http://www.codeplex.com/nodexl), a spreadsheet add-in that supports "network overview discovery and exploration".  The tool fits inside your existing copy of Excel in Office 2007 or 2010 and makes creating a social network map similar to the process for making a pie chart.

Using NodeXL, users can easily make a map of public social media conversations around topics that matter to them. Maps of the connections among the people who recently said the name of a product, brand or event can reveal key positions and clusters in the crowd.  Some people who talk about a topic are more in the "center" of the graph, they may be key influential members in the population.  NodeXL makes it a simple task to sort people in a population by their network location to find key people in core or bridge positions.  NodeXL supports the exploration of social media with import features that pull data from personal email indexes on the desktop, Twitter, Flickr, YouTube, Facebook and WWW hyper-links.  The tool allows non-programmers to quickly generate useful network statistics and metrics and create visualizations of network graphs.

A book Analyzing Social Media Networks with NodeXLInsights from a connected world is available from Morgan-Kaufmann.  The book provides an introduction to the history and core concepts of social network analysis along with a series of step-by-step instructions that illustrate the use of the key features of NodeXL.  The second half of the books is dedicated to chapters by a number of leading social media researchers that each focus on a single social media service and the networks it contains. Chapters on Twitter, email, YouTube, flickr, Facebook, Wikis, and the World Wide Web illustrate the network data structures that are common to all social media services.

Recent research featuring data analyzed with NodeXL was released by the Pew Internet Research Center.  The report documents the six distinct patterns of social media network structures found to occur in Twitter and related forms of social media platforms.

MarcSmith.png

Bio 

Dr. Marc A. Smith

Chief Social Scientist

Connected Action Consulting Group

Marc@connectedaction.net  
http://www.connectedaction.net

http://nodexl.codeplex.com

http://twitter.com/marc_smith
http://www.smrfoundation.org/

Marc Smith is a sociologist specializing in the social organization of online communities and computer mediated interaction. Smith leads the Connected Action consulting group and lives and works in Silicon Valley, California.  Smith co-founded the Social Media Research Foundation (http://www.smrfoundation.org/), a non-profit devoted to open tools, data, and scholarship related to social media research.

Smith is the co-editor with Peter Kollock of Communities in Cyberspace (Routledge), a collection of essays exploring the ways identity; interaction and social order develop in online groups. Along with Derek Hansen and Ben Shneiderman, he is the co-author and editor of Analyzing Social Media Networks with NodeXL: Insights from a connected world, from Morgan-Kaufmann which is a guide to mapping connections created through computer-mediated interactions.

Smith's research focuses on computer-mediated collective action: the ways group dynamics change when they take place in and through social cyberspaces. Many "groups" in cyberspace produce public goods and organize themselves in the form of a commons (for related papers see: http://www.connectedaction.net/marc-smith/). Smith's goal is to visualize these social cyberspaces, mapping and measuring their structure, dynamics and life cycles. While at Microsoft Research, he founded the Community Technologies Group and led the development of the "Netscan" web application and data mining engine that allowed researchers studying Usenet newsgroups and related repositories of threaded conversations to get reports on the rates of posting, posters, crossposting, thread length and frequency distributions of activity.  He contributes to the open and free NodeXL project (http://www.codeplex.com/nodexl) that adds social network analysis features to the familiar Excel spreadsheet.  NodeXL enables social network analysis of email, Twitter, Flickr, WWW, Facebook and other network data sets.

The Connected Action consulting group (http://www.connectedaction.net) applies social science methods in general and social network analysis techniques in particular to enterprise and internet social media usage.  SNA analysis of data from message boards, blogs, wikis, friend networks, and shared file systems can reveal insights into organizations and processes.  Community managers can gain actionable insights into the volumes of community content created in their social media repositories.  Mobile social software applications can visualize patterns of association that are otherwise invisible.

Smith received a B.S. in International Area Studies from Drexel University in Philadelphia in 1988, an M.Phil. in social theory from Cambridge University in 1990, and a Ph.D. in Sociology from UCLA in 2001. He is an adjunct lecturer at the College of Information Studies at the University of Maryland.  Smith is also a Distinguished Visiting Scholar at the Media-X Program at Stanford University.

Seventh Meetup: Tuesday, April 15, 2014, 6:30 p.m.

Katherine Goodier, Cognitive Metadata, and Cambridge Semantics, Insider Trading

  • 6:30 p.m. Brand Niemann, Introduction and Xcelerate Solutions Refreshments
    • Data Papers in Data Browsers Tutorial: On your Own (where did you get the data, where did you store the data, and what were your results?)
    • Data Science for the Financial Industry and Three Approaches to Semantic Normalization and Interoperability (see background research below)
  • 6:35 p.m. Dr. Katherine Goodier, Technical Arcjitect, Excelerate Solutions, Cognitive Metadata: The Killer Enabler for Federal Big Data Security and Privacy in the Clouds: It’s all about the metadata Slides
  • 7:00 p.m. Brief Member Introductions
  • 7:10 p.m. Dr. Katherine Goodier Continues
  • 7:30 p.m. Marty Loughlin, VP, Financial Services, Cambridge Semantics (Remote), Examples of customer use cases in financial services in areas like compliance, data onboarding, and insider trading surveillance. Slides
  • 8:30 p.m. Open Discussion
    • Randy Coleman's comment to Cambridge Semantics “that they couldn’t be using FIBO yet”, or something like that. Response: "That's right, the only formal FIBO that was out at the time of the demo was FIBO -FND and BE (Foundation and Business entities).  In order for Cambridge to use FIBO in the context of the demo, they need several ontologies in the Common Objects Classification that is not out yet.  Some of Common Objects is being released for RFC as we speak."
    • I also recall at the Chasm Seminar someone saying that FIBO has developed only 2 of the 27 ontologies needed for more complete implementation. Is someone developing those additional FIBO ontologies? Response: "That was Dennis Wisnosky, we are working on an approach to accelerate the release of additional ontologies through a process called "FIBO-in Waiting".  User communities could build FIBO Content Teams (FCTs) in partnership with EDMC and work with advance release FIBO Ontologies and feed back comments and improvements to EDMC/OMG through the FCTs.  This would create an iterative process to mature all of the FIBO ontologies as quickly as possible and get them into RFC for final acceptance."
  • 8:45 p.m. Networking
  • 9:00 p.m. Depart

Background:

  • Andrea Westerinen (invited-accepted-then not-changed to Cambridge Semantics)Reusing the FIBO Ontologies for Financial Institutions Her Note: My use of FIBO was not related to big data at all. It was purely for mining relevant concepts for another ontology. So, I don't have any relevant information to discuss, although I am interested in the big data problem, especially related to using ontologies.​
  • Kees Manson (invited-change employment-requested replacement): Be Informed Metamodels for Financial Services
  • Capital One (invited-waiting to hear back): Data Science Analytics for Financial Services. You can read more about Capital One’s story (Clemons & Thatcher, 1998; McNamee 2001).
  • The original team selected was as follows and all have contributed inputs:
    • Dennis Wisnosky and Mike Bennett, EDMC (Remote)
    • Randy Coleman, Wizdom Systems (Local-accepted)
    • Elisa Kendall, Thematix (Remote)
    • Brand Niemann, Semantic Community (Organizing and Leading)
    • Dean Allemang, Working Ontologist (Remote)
    • Aaron Bossett, YarcData  (Local - accepted)
    • Kees van Mansom, Be Informed (Reviewing-to be replaced)
    • More to be added if needed

END

The Cambridge Semantic Web Meetup Group

Date Number

Title

Presenter Description Bio Comments URL
April 8, 2014 62 From Data-analytics to Data-applications David King, Founder and CEO of Exaptive, Inc. How Applying Linked-Data Principles to Modular Software Development Enable a New Generation of Collaborative Applications Over 15 years of experience in all aspects of software development, from large-scale database design to user interfaces to the management of software development teams. Mr. King has a BS in electrical engineering and computer science from MIT and helped to pioneer paperless manufacturing information systems during his 12 years at SynQor Inc, where he was Chief Software Architect and IT Director. 6 Ambiguous Words
March 11, 2014 42 Building Social Applications with the Linked Data Platform Sandro Hawke (W3C/MIT), Andrei Sambra (MIT), Joe Presbrey (MIT) The Linked Data Platform (LDP) is an emerging set of standards allowing web servers to provide efficient, shared, generic (RDF) application data storage.  By using ordinary HTTP operations with LDP features, applications can use the web as their backend database. Sandro has been a member of W3C's Semantic Web staff since the early days and helped standardize RDF, OWL, RIF, SPARQL, and other related technologies. Andrei has recently obtained his PhD on the topic of decentralized identity and authentication, while paying special attention to the privacy aspects involved.   Joe has worked on several free and open source software projects. 2 Cimba
February 11, 2014 30 Using Semantic Web technology to detect insider trading. Marty Loughlin, Cambridge Semantics Marty Loughlin will discuss use cases for SemWeb tech in the financial services and give a demo of using the Anzo software for insider trading surveillance. Oliver Ruebenacker, PanGenX: Personalized Medicine: Linking Pre-Clinical and Clinical Data. -- CANCELED DUE TO LAST MINUTE TECHNICAL DIFFICULTIES 0  
January 14, 2014 50 The Semantic Web Landscape Lee Feigenbaum, Cambridge Semantics In this talk, Lee Feigenbaum will present an introductory overview of the Semantic Web, its technologies, and how they are applied both on the Web and in enterprises. Lee Feigenbaum is a co-founder of Cambridge Semantics and has been active in the Semantic Web community for ten years. Lee co-chaired the SPARQL Working Group and is a creator and editor-in-chief of Semantic University. Lee also wrote about "The Semantic Web in Action" in a 2007 _Scientific American_ article. 3 Federal Big Data Working Group Meetup
December 10, 2013 39 More, Better, Faster: Enabling Graph Analytics at Scale Tom Bourgoin, Segment Director, Life Sciences An introduction to YarcData. a subsidiary of Cray. YarcData's Urika appliance is a supercomputer specially designed for working with big data graphs.   1  
November 12, 2013 51 Linked TCGA: A clinically relevant 30 billion triple Dataset Helena Deus, Senior Scientist and Medical Knowledge Engineer at Foundation Medicine The Cancer Genome Atlas (TCGA) is a multidisciplinary, multi-institutional effort to catalogue genetic mutations responsible for cancer using genome analysis techniques. It consists of over 400Tb of valuable clinically actionable data which doubles in size every 9 months. In this talk I will describe how we built TopFed, our robust and 
distributed solution for supporting clinically driven SPARQL queries.
7

Slideshare

SD quality

HD quality

Live Demo Link

Semantic Medline

October 8, 2013 49 Application of Semantic Web Technologies in the development of a Comparison Shopping Engine
Eric Gieseke
The project leverages the GoodRelations 
e-Commerce ontology, Sesame, Alibaba, SPARQL, and a UI that 
dynamically modifies its available search criteria based on the 
properties of the products within context.
Eric Gieseke is a Software Architect for a leading payment 
solutions company and teaches Software Design at the Harvard 
Extension School.
0  
September 10, 2013 35 Validating RDF Input Eric Prud'hommeaux, W3C The RDF Validation workshop (10-11 Sept) will collect use cases and technical solutions to the needs for RDF service interface definition/verification. Eric will present the use cases gathered on day one, as well as those he has gathered from medical informatics, and present a specialized language called ShEx (Shape Expressions) which addresses some or most of those requirements.   3

ShEx

yacc

parser generators

August 13, 2013 59 The RDF Pipeline Framework: Automating Distributed, Dependency-Driven Data Pipelines David Booth, Independent Consultant This presentation explains how distributed data production processes can be conveniently described in RDF as executable dependency graphs, using the RDF Pipeline Framework. A live demo of a simple data pipeline will be included. The RDF Pipeline Framework is open source software available under an Apache 2.0 license. 9  
July 9, 2013 47 The BEL Framework and RDF Tony Bargnesi, Selventa The BEL Framework is an open-source knowledge management tool for biological knowledge captured in BEL (Biological Expression Language). BEL and the BEL Framework together are known as OpenBEL. The framework is being reimagined with the help of RDF, graph databases, and a scalable architecture. In this talk we will discuss the current goals, architecture, and RDF model ideas. A prototype demonstration is planned. 6 Presentation
June 11, 2013 56 State-of-the-art In Semantic Interpretation of Health Care Data Eric Prud'hommeaux, W3C Initiatives like the Affordable Care Act in the US and Electronic Health Records for Clinical Research in Europe seek to get more use out of your clinical data for drug research, machine-assisted medication interaction and adverse event detection, efficacy evaluation, and even simple patient portability. Recent work by HL7 effectively defines the RDF representation for the clinical data exchange format (CCDA) mandated by the US government. Eric Prud'hommeaux is a sanitation engineer employed by the World Wide Web Consortium (W3C) at MIT. 11 Slides
May 14, 2013 70 Open Annotation Core Data Model Dr. Paolo Ciccarese, Massachusetts General Hospital and Harvard  Medical School The Open Annotation Core Data Model produced by the   Open Annotation Community Group specifies an interoperable framework for creating associations between related resources, called annotations, using a methodology that conforms to the Architecture of the World Wide Web. I am a senior scientist and software engineer, working in the bio-medical informatics field since the year 2000. My extensive background in software engineering, knowledge management and biomedical informatics. 14

Part 1

Part 2

Part 3

Code Example

Semantic University

April 9, 2013 45

SemWeb research at Rensselaer (RPI)

New features in RDF 1.1

 

 

Jim Hendler,RPI

Sandro Hawke, W3C

 

A lightning talk on SemWeb research at Rensselaer (RPI), then take questions.  Earlier in the day he's giving a longer presentation

New features in RDF 1.1, which is being finalized in the next few months.   New features include JSON-LD, Turtle (finally being a standard), Trig/Named Graphs, and some improvements in how Literals are defined..

Jim Hendler is the Tetherless World Professor of Computer and Cognitive Science, and the Head of the Computer Science Department at Rensselaer

Sandro Hawke is a software developer specializing in decentralized systems based on open Web technologies.

14

Slides

Slides

April 9, 2013 30 Broad Data: What happens when the Web of Data becomes real? Jim Hendler, Tetherless World Professor of Computer and Cognitive Science Rennselaer Polytechnic Institute USA In this talk, we present the broad data challenge and discuss potential starting points for solutions. We illustrate these approaches using data from a "meta-catalog" of over 1,000,000 open datasets that have been collected from about two hundred governments from around the world. Hendler is the first computer scientist to serve on the Board of Reviewing Editors for Science. Hendler was named one of the 20 most innovative professors in America by Playboy magazine and was selected as an “Internet Web Expert” by the US government. 5 Slides
March 12, 2013 51 Venture Capital and Entrepreneurship Outlook for Semantic Web in 2013: A Panel Discussion Richard Dale -- Managing Director, Big Data Boston Ventures, David de Graaf -- CEO, Selventa. Anthony Philippakis -- Venture Partner, Google Ventures, and David Steinberg -- PureTech Ventures We have invited experts from venture capital and from emerging industry to share their perspectives on opportunities and trends for Semantic Web in 2013 and beyond. Please join us to hear what they have to say, and to join in the interactive conversation.   9  
February 12, 2013 73

Leveraging Semantics for Text Mining and Text Analytics

MGH: WebID module for Drupal

 

Richard Mallah, Cambridge Semantics

Stéphane Corlosquet, MIND Informatics

Linked Data, OWL, and semantic relationships can significantly help inform the natural language processing algorithms, and can also be leveraged to link annotations or extractions from multiple different NLP engines in meaningful ways. Semantic links to structured and semistructured
data also allow these higher-level unstructured extractions to then be used operationally and in context. Join us as we show how Cambridge Semantics' Anzo Unstructured unlocks this potential.
WebID is an open standard for identity and password-less login on the Web. WebID is designed to help alleviate the difficulty that remembering different logins, passwords and settings for websites has created. It is also designed to provide a universal and extensible mechanism to express public and private information about identity online. 10

Slides

Drupal

Scor

 

January 8, 2013 52 Demos from "Linked Data" - An upcoming book for Web developers. David Wood, Three Round Stones This talk will demonstrate two code examples from the book:  A Friend-of-a-Friend (FOAF) generator in JavaScript and an example of using Linked Open Data to create a weather reporting service for any ZIP code. David Wood is co-chair of the W3C RDF Core working group, and CTO of Three Rounds Stones 6 Slides

END

Eight Meetup: Tuesday, May 6, 2014, 6:30 p.m.

EPA/NASA Climate-Environment­­al Data Analytics & A Redesigned, Open Data.gov

Jeanne Holm, Evangelist, http://Data.gov ; Chief Knowledge Architect at NASA/JPL; UCLA Teacher and Alum
Pasadena, CA · data.gov @JeanneHolm

https://twitter.com/JeanneHolm

Joan L. Aron, Ph.D.

joanaron@ymail.com

Profile

Dr. Aron is highly experienced in building U.S. and international partnerships in global environmental/climate change that engage scientists, social scientists, policy-makers and the public health community. She has expertise in and an interdisciplinary understanding of the applications of Earth observations, including linkages to decision-making and capacity-building.  She has a diverse background in climate change policy, science, assessments, impacts, vulnerability, adaptation/mitigation and sustainable landscapes.  She has extensive relationships with the science and technology policy communities in U.S. and international agencies.  She has excellent written and oral communication skills developed in a variety of work settings (academia, nonprofit organization, government and private sector).  She also has experience in the coordination of professional development workshops. Her background is in global environmental/climate change, public health, ecology, applied mathematics, and information systems.

Selected publications

Aron JL, Hall RK, Philbin MJ, Schafer RJ. 2013.  Managing sustainable natural resources in support of water quality.   
WaterWiki (Nov. 26, 2013)
http://www.iwawaterwiki.org/xwiki/bin/view/Blog/Managing+Sustainable+Natural+Resources+in+Support+of+Water+Quality

Aron JL, Hall RK, Philbin MJ, Schafer RJ. 2013.  Using watershed function as the leading indicator for water quality.  Water Policy 15: 850-858.

Aron JL, Hall RK, Philbin MJ, Schafer RJ, Swanson S. 2012. Assessing ecological functionality in watersheds [Restoring ecological functionality in watersheds to control erosion and siltation: reducing nonpoint land-based sources of marine pollution in the wider Caribbean] (First Meeting of the Scientific, Technical and Advisory Committee for the Land-based Sources Protocol, Oranjestad, Aruba, 5 – 7 June 2012). http://www.cep.unep.org/meetings-events/1st-lbs-stac [Scroll down to Day 2 of Presentations] 

Aron JL, Corvalán CF, Philippeaux H, eds. 2003. Climate Variability and Change and their Health Effects in the Caribbean: Information for Adaptation Planning in the Health Sector. Conference May 21-22, 2002, Workshop May 23-25, 2002, St. Philip, Barbados, West Indies. World Health Organization, Geneva. ISBN 92 4 159071 8 (English)

Aron JL, Corvalán CF, Philippeaux H, eds. 2003. Variabilidad y cambio climático y sus efectos sobre la salud en el Caribe: Información para planificar la adaptación en el sector salud. Conferencia 21-22 de mayo de 2002, Taller 23-25 de mayo de 2002, St. Philip, Barbados, West Indies. Organización Mundial de la Salud, Ginebra. ISBN 92 4 359071 5 (español)

Aron JL, Patz JA (eds.) 2001.  Ecosystem Change and Public Health: A Global Perspective, Johns Hopkins University Press, Baltimore, Maryland.

Buck AA, Aron JL.  Epidemiological study designs (Chapter 2)

Aron JL, Glass GE. Geographic information systems (Chapter 3)

Aron JL, Ellis JH, Hobbs BF.  Integrated assessment (Chapter 5)

Aron JL, Shiff CJ, Buck AA.  Malaria and global ecosystem change (Chapter 12)

Roberts L, Confalonieri UEC, Aron JL.  Too little, too much: how the quantity of water affects human health (Chapter 14)

END

Ninth Meetup: Tuesday, May 20, 2014, 6:30 p.m.

Data Science at GMU and Elsevier Research Data Services

  • 6:30 pm 6:30 p.m.Welcome and Introduction Slides
  • 6:35 pm Continue Data Science Tutorial: Practical Data Science for Data ScientistsData Science Students and Careers See Professor Dr. Kirk Borne of George Mason University Slides. Graduate Students Working on Semantic Medline-YarcData Projects: GMU Updates Master's Program for Data Science and Sarah Soliman, Rand, and IV MOOC Student Project (invited-rescheduled to June)
  • 7:00 p.m. Brief Member Introductions
  • 7:10 pm Big Data - Forward - Backward, Charles Randall Howard, Adjunct Professor in the Applied IT Department and Sr. Data Scientist at Novetta Solutions. Slides
    • Professor Howard has a Ph.D. in Information Technology from George Mason University, a M.S. in Information Systems from Virginia Commonwealth University, and a B.S. in Information Systems from Virginia Commonwealth University.  As a Sr. Data Scientist at Novetta Solutions, he guides Big Data Science initiatives to solve problems and seize opportunities towards enabling organizations in realizing Big Data benefits across their entire organization (vs. just a few data scientists). He focuses on bringing technology back to making businesses more efficient and effective in delivering results.  Previous experience includes Principal Data Scientist at Berico Technologies, Principal Semantic and Knowledge Scientist at Boeing-SMSI, Principal Consultant at SRA International, INC./Raba Technologies; Vice President of Engineering at Tech I2; and Principal Software Engineer at Raytheon.
  • 7:45 pm Stories that Persuade, Anita de Waard, VP Research Data Collaborations at Elsevier Research Data Services/University of Utrecht. Slides and Slide Share. Also see Looking for Data: Finding NewScience and Ten Habits of Highly Effective Data
    • Anita de Waard has a background in experimental physics. She joined Elsevier as publisher in physics and neurology in 1988, and since 1997 she is employed as a Principal Researcher for Disruptive Technologies in the Labs group. Her main focus is the development of innovative product concepts, with a specific interest in establishing collaborations between Elsevier and academic groups in information and computer science. In 2003, Anita founded and ran the Reed-Elsevier Data Standards Group. Her interests include the application of Semantic Web technologies for scientific communication, and the development of a new, semantic form for the scientific article.
  • 8:30 p.m. Open Discussion
  • 8:45 p.m. Networking
  • 9:00 p.m. Depart .

Please Note Suggestions for Future Meetups:

Tenth Meetup: Monday, June 2, 2014, 6:30 p.m.

Ontology Summit 2014 Postmortem and Reading & Reasoning with Semantic Insights for the DARPA Big Mechanism

Ontology Summit 2014 Postmortem, May 15, 2014:

I have followed up with requests for additional information on our work from all four keynote speakers as follows: 

George Strawn also challenged us to find another (Semantic Medline on YarcData being the first) best practice example of “Big Data and Semantic Web Meet Applied Ontology”, or as we like to say “Big Data with Semantic Web and Applied Ontology”. The one we selected is the new Data Science for Climate Change Impacts in the United States Report and Web Site, which also happens to be for his boss John Holdren, the President’s Science Advisor!
 
Data Science for Climate Change Impacts This is certainly Big Data and use of the Semantic Web and Applied Ontology which we are building on to make this a Data Publication in a Data Browser.
 
This work is the subject of future Meetups and we hope there are future Applied Ontology Summits on this subject: “Big Data with Semantic Web and Applied Ontology”

Chuck Rehberg: As CTO at Trigent Software and Chief Scientist at Semantic Insights™, Chuck has lead the development mission critical systems in use today at Navistar International Corporation (NYSE: NAV); developed and patented the “Fast Rules Selection Engine” (FRSE), the world’s fastest and most scalable rules engine; lead due diligence for mergers and acquisitions and directed the subsequent integration of personnel and technology. Chuck has developed several enabling technologies (held as trade secrets) to allow improved reliability, performance, and delivery times while increasing productivity and minimizing cost; With over thirty years in the high-tech industry, developing leading-edge solutions in the areas of Artificial Intelligence and AI-based configuration software, Chuck holds several patents in cutting edge natural language processing technologies powering a new generation of semantic-based solutions. Semantic Insights™ is the R&D division of Trigent Software, Inc. NSF BigData Proposal

Future Meetups in July - September:

Eleventh Meetup: Monday, June 30, 2014, 6:30 p.m.

MIT Big Data Initiative: Sam Madden, & Current Elephants: Michael Stonebraker

Background: See ​Workshops on Extremely Large DatabasesEarthCube Data Science Publications, and EarthCube Special Interest Group

Twelveth Meetup: Monday, July 7, 2014, 6:30 p.m.

Data Science of White House Big Data Review and Brooke Aker: Big Data Lens on OpenFDA

See: Slide 2 Big Data in Space and Earth SciencesSlide 1 Big Data and NITRD, and Slide 2 Effectively Exploiting Big Data with Semantics: A Pilot Project

  • Background: See Data Science for the Big Data ReviewData Science for OpenFDA, and NSF Strategic Plan
  • 6:30 p.m. Welcome and Introduction (New Tutorial and Mentoring) Slides
  • 6:45 p.m. Mary Galvin, AIC, HPCC Systems Academic Program and the Georgetown University McCourt School of Public Policy’s Massive Data InstituteSlides
  • 7:10 p.m. Brief Member Introductions
  • 7:15 p.m. Katherine Goodier​, Excelerate Solutions​, Legislative Data and Transparency Conference and Slides and Excel and Chuck Rehberg, Semantic Insights, SIRA Part II Slides
  • 7:45 p.m. Brooke Aker. Big Data Lens A Look at OpenFDA API and Big Data Design(s) Based on It. We’ll take a look at the OpenFDA API that came out last month, navigate some of the data, queries and results you can get from this important new Big Data source. We’ll also look over some Big Data design work based on melding OpenFDA data with other federal data sources.​ Slides (in line) and Slides (PowerPoint)
    • Background
      • Working on data analytics since 1987 when I did my first regression analysis on surplus government cheese !! Now working on healthcare and security predictive analytics and machine learning.
    • Networking and Agenda with Announcements, Presentations, Training, and Demos
      • Here is a nice method to use if you are seeking to understand new technology, it's applicability and readiness for use. It is also emblematic of good Big Data practice - turning a large, free information resource into something valuable with simple straightforward thinking and driven by sophisticated software. Enjoy. http://www.bigdatalens.com/blog/2014...ta-methodology
    • Participation in other Meetups
      • Lots of other Big Data Meetup Groups. Was at the Data Salon in Cambridge Mass last night !!
    • Do you live near a DC Metro Station and use Skype?
      • Use Skype for sure
  • 8:30 p.m. Open Discussion
  • 8:45 p.m. Networking
  • 9:00 p.m. Depart

September - December: Once a month
Silver Line Spring Hill Metro Station Opens July 26th

Thirteenth Meetup: Monday, July 28, 2014, 6:30 p.m.

Silver Line Metro For OMB Ontology Memo from Multiple Experts (Blevins, Morosoff, & Pohl)

Background: OMB Ontology and Ontologizing Memo - In Summary:
An Ontology:

  • is a formal representation of meaning in an information system;
  • creates the bridge between the internal world of the computer and the external world of people’s understanding;
  • provides an inter lingua between disparate data sources and knowledge bases;
  • allows us to build useful and usable systems for complex tasks in health care.

Remember:

  • don’t try to divorce the Ontology from its application (the ‘universal ontology’)
  • building and embedding an Ontology in a useful application has pitfalls that require judgment, experience, clarity of purpose, and resources.

Also see: Intelligent Information Management Tools in a Service-Oriented Software Environment (Pohl) and E-MAPS on Ontology and Big Data

Agenda:

  • 6:30 p.m. Welcome and Introduction (New Tutorial and Mentoring) Slides
  • 6:45 p.m Peter Morosoff, President, E-MAPS, Inc. See wiki version at​ E-MAPS on Ontology and Big Data Slides
  • 7:00 p.m Jens G. Pohl, PhD, Professor of Architecture, Emeritus, California Polytechnic State University,  Senior Director, Adaptive Systems, Tapestry Solutions (a Boeing Company)​ Paper: Intelligent Information Management Tools in a Service-Oriented Software Environment. Note: See wiki version at Intelligent Information Management Tools in a Service-Oriented Software Environment Slides
  • 7:30 p.m. Brief Member Introductions
  • 7:45 p.m.​ David Blevins, Staff Engineer, Booz Allen Hamilton. Currently supports Life Sciences research performed by the Federal Government. Ontologies in Medical Care and Integration/Reuse Challenges at the Clinical and Enterprise Level Slides​
  • 8:15 p.m. Open Discussion
  • 8:45 p.m. Networking
  • 9:00 p.m. Depart

Peter Morosoff, President, E-MAPS, Inc., 10340 Democracy Lane, Suite 302, Fairfax, VA 22030

Jens G. Pohl, PhD, Professor of Architecture, Emeritus, California Polytechnic State University,  Senior Director, Adaptive Systems, Tapestry Solutions (a Boeing Company), San Luis Obispo, CA 93401

David Blevins, Staff Engineer, Booz Allen Hamilton. Currently supports Life Sciences research performed by the Federal Government. Prior to joining Booz Allen Hamilton, Mr. Blevins worked as an interface analyst for Etransmedia, where he acted as the vendor technical lead supporting the Allscripts products offered under the University of Pittsburgh Medical Center (UPMC) Electronic Health Solutions initiative.

To Be Rescheduled

Mike Cummens, MDI am in the late stages of putting together a start up to provide pharma, academic medical centers and IDNs with a data integration platform including semantic interoperability, cellular level security and full lifecycle data lineage management. ​2006: Slides and Word

Fourteenth Meetup August 4, 2014, 9:00 a.m.

COM.BigData 2014: The 1st International Summit on Big Data Computing

http://www.meetup.com/Federal-Big-Da...nts/197717282/

Keynote and Panel

50% off discounts to Meetup group members - offer has expired

The Federal Big Data Working Group Panel

The Federal Big Data Working Group Panel will feature these members who will discuss their involvement in the working group, research applications, use cases, publications, solutions, and technologies:

  • Dr. Katherine Goodier, Xcelerate Solutions (Panel Chair) Slides
  • Dr. Kirk Borne, George Mason University Slides
  • Dr. Chuck Rehberg, Trigent and Semantic Insights Slides and Slides
  • Dr. Joan Aron, Independent Consultant Climate Data Slides
  • Dr. Tom Rindflesch, National Institutes for Health Slides
  • Ms. Mary Galvin, Managing Principal, AIC & Semantic Community Slides

Fiftheenth Meetup, Monday, August 11, 2014, 6:30 p.m.

SilverLine Metro More Ontology Experts (Baclawski, Guerino, Morosoff, & Goodier)

Recall from Seventeenth Meetup, April 15, 2014: 

Sixteenth Meetup, Monday, September 8, 2014, 6:30 p.m.

Proposed: Joint Meetup with OSTP FASTER CoP: NSF for Data Scientists, Data Infrastructure, and Data Publication

Note: We will come back to this and are on to the Astronomy pilot to be presented on September 10th and October 9th now.

Background: See Story and Slides

  • 6:30 p.m. Welcome and Introduction, FASTER Co-chairs, Robert Chadduck (NSF), and Robert Bohn (NIST)
  • 6:45 p.m Big Data and the NITRD: NSF Strategic Plan for Big Data and Open Research Data Publications, Dr. George Strawn, NITRD Director Slides
  • 7:10 p.m. Brief Member Introductions
  • 7:15 p.m NSF Strategic Plan Knowledge Base, Brand Niemann, Federal Big Data Working Group Meetup Slides
  • 7:45 p.m.​ Finding Funding for Research Topics on NSF Website (a simple example of “finding”), 04-AUG-2014, Chuck Rehberg, CTO, Semantic Insights™ a Division of Trigent Software, Inc. Slides
  • 8:30 p.m. Open Discussion
  • 8:45 p.m. Networking
  • 9:00 p.m. Depart

Seventeenth Meetup, Monday, September 22, 2014, 6:30 p.m.

Meetup.com: NITRD FASTER CoP Meetings & Inter-American Development Bank Open Data Portal

  • 6:30 p.m. Welcome and Introduction - New Tutorial and Mentoring on Recent NITRD FASTER CoP Meetings, Data Science and Analytics in Government Meetup, Current and Upcoming Activities, and Big Data, Analytics, and Data Science. Slides. (See Story and Slides for September 8th Meetup Not Held)
  • 7:10 p.m. Brief Member Introductions
  • 7:15 p.m Finding Funding for Research Topics on NSF Website (a simple example of “finding”), 04-AUG-2014, Chuck Rehberg, CTO, Semantic Insights™ a Division of Trigent Software, Inc. Slides
  • 7:45 p.m.​ Annette Hester, Project Coordinator, Energy Innovation Center, Infrastructure and Environment Sector, 
    • Annette Hester has developed the Open Data portal for the Inter-American Development Bank. Her portal allows international development users to export numerous categories of data in JSON, RDF, and CSV formats.  She is exploring more use cases and opportunities for Linked Data to inform decisions in government and business.
  • 8:30 p.m. Open Discussion
  • 8:45 p.m. Networking
  • 9:00 p.m. Depart

Eighteenth Meetup, Monday, October 6, 2014, 6:30 p.m.

FDA Data Innovation Lab and Predictive Analytics Meetup

Nineteenth Meetup, Thursday, October 9, 2014, 7:30 a.m.

Conference for NSF Data Scientists, Data Infrastructure, and Data Publication

http://www.digitalgovernment.com/Age...ce--Expo.shtml

What: Government Big Data Conference & Expo
When: October 9, 2014
Where: The Pavilion, Ronald Reagan Building, 1300 Pennsylvania Ave., NW, Washington, DC

Executive Breakfast by Invitation (PDF)

Our session is entitled "Challenges and Solutions for Big Data in the Public Sector," and is scheduled for 11:15AM-12:00PM on October 9 (including remarks and Q&A).

DGI is willing to offer FBDWG members a 40% discount on registration fees for the Government Big Data Conference on October 9th. Please note – registration fees for government, press, and academia are complimentary.

We will develop a registration link specifically designed for FBDWG that you can share at the Meet Up on Monday.

Thanks again for your participation – it is appreciate. Mike Smoyer
President, Digital Government Institute

Credential requirements for editorial press registration

Source: http://www.digitalgovernment.com/med...ile88_2515.pdf:

When registering as a member of the Press for any Digital Government Institute Conference, qualified press are required to provide one of the following if requested:

  • Masthead from a current issue of an industry-related publication listing you as an editorial contributor
  • An industry-related article published within the past three months with your byline
  • A letter from the news director or assignment editor on station/network letterhead saying that you have been assigned to represent the organization in an editorial capacity at the noted DGI conference
  • Press photo ID
  • Business card from your media outlet reflecting your editorial role
  • Online publications must also provide the demographics and/or an audited statement verifying the site gets 1,000 or more unique visitors a month. The Web publication must be a previously established, independent site that is regularly updated with original and current broadcast or media-related news. Personal blogs, forums and user groups will not be considered for media registration.

Questions? Please contact us at info@digitalgovernment.com.

Twentieth Meetup, Monday, November 3, 2014, 6:30 p.m.

Data Science for NOAA Chief Data Officer and Big Data Predictive Analytics

  • 6:30 p.m. Welcome and Introduction (New Tutorial and Mentoring) Slides
  • 7:00 p.m. Brief Member Introductions
  • 7:10 p.m. Treeminer.com Video, Mark Silverman and Biplab Pal
  • 7:30 p.m. Predictive Analytics in the Era of Big Data, Dave Vennergrund, Director, Data Analytics Center of Excellence, SalientFed Slides
    • I will discuss advanced analytic opportunities opened by the confluence of ever-expanding machine learning libraries and algorithms, unprecedented amounts of data, and the distributed computing platforms that support at-scale analysis of both relational and Big Data (HDFS) data structures. In addition, I will share lessons learned from past predictive analytic efforts in federal healthcare, intelligence, finance, personnel, benefits, improper payment, fraud detection, and tax analytics.
    • I am  responsible for leading and expanding innovations and best practices for the Salient Data Analytics Center of Excellence – in data science and predictive analytics. I have over 25 years of Federal R&D,  IT management, solution development, and research experience. I have led dozens of successful business intelligence, predictive analytics, and data mining-based projects across the Federal government including budget forecasting for HUD and DOD; improper payment prevention for IRS, USDA, CMS, VA, DFAS, and OPM; and predictive modeling for DOI, EPA, US AID, and VA. I  was the Director of CACI's (formerly Delta Solutions and Technologies, Inc.) Business Analytics Practice and built a service line that offered advanced business intelligence, big data solutions, data mining analytics, fraud detection, and predictive analytics. I spent over 14 years with SRA International, Inc. where I had founded a data mining center of excellence and a data warehousing practice.  Recent publications at Predictive Analytic World Government 2011 (forecasting attrition at DOI) and 2012 (forecasting HUD housing budgets); Medicare and Medicaid Statistics and Data Analysis Conference 2011 (Fraud Detection method survey) and 2012 (Profiling providers with Big data methods). Masters in Computer Science (Artificial Intelligence) from Arizona State University 1986. Co-Chair KDD-2003.

  • 8:30 p.m. Open Discussion
  • 8:45 p.m. Networking
  • 9:00 p.m. Depart

Rescheduled to January 5th or February 2nd, 2015: Wolfram​ AlphaData Science PlatformDiscovery PlatformLanguage, and Data Summit​ 2014.​

Rescheduled to January 5th or February 2nd, 2015:Big Data Science for DHS, Qasim Hussain, COO, Greenzone Solutions, Inc.

Recent Presentation and Slides: Stephen Dennis, Director, Innovation, Science and Technology Directorate, Department of Homeland Security, Big Data Analytics and Homeland Security

My Notes:

I don’t know what this “data” stuff is, but I want some of it…

DHS S&T Mission: Strengthen America’s security and resiliency by providing knowledge products and innovative technology solutions for the Homeland Security Enterprise (HSE)

Superstorm Sandy (Initial Findings) from NUSTL My Note: FAIRport this!

Statement of Big Data Problem in DHS

S&T’s Big Data Survey: Goal is to improve operational effectiveness and efficiency within the Department and HSE

Continue to work cultural issues that tend to plague big data

FEMA: Improved Utilization of Data Sets My Note: I worked on this!

Leveraging Leading-edge Data Science Research My Note: That is what he asked me to show him!

Big Data Lessons Learned

Determine what data exists and how it can it be manipulated to make it useful

End of My Notes

Semantic Insights Followup

Looking for interested individuals who wish to participate in our Natural Language Understanding and Reasoning research. We welcome educational institutions and individual researchers interested in working collaboratively with us. Accounts are available for beta test: http://www.semanticinsights.com/

Applying High-speed Pattern Recognition to Generate Queryable Semantics from Big Data - Big Data is filtered and reduced in real-time for event and pattern discovery: Applying High-speed Pattern Recognition to Generate Queryable Semantics from Big Data (PDF)

Why Is Statistical Literacy So Important?

Slides and Washington Statistical Society Newsletter about the new Stat Ed program committee
Elizabeth Johnson
Department of Statistics
George Mason University
Johnson Center 326, Meeting Room B
4400 University Drive, Fairfax, VA 22030
Time: 11:00 A.M. - 12:00 Noon
Date: Friday, October 10, 2014 

Abstract

While there are many definitions of statistical literacy, there is no disagreement about its importance for an educated society. Past and current presidents of the American Statistical Association (ASA) have played an important role in promoting both statistics education and statistical literacy. In 1950, Samuel Wilks stated that “Statistical thinking will one day be as necessary for efficient citizenship as the ability to read and write!” In the 1990’s Richard Scheaffer helped to create the Advanced Placement (AP) Statistics program and in-coming ASA president Jessica Utts will be the 2015 Chief Reader of the AP Statistics exam. In this presentation I will review the history of the AP Statistics program, the creation of the Guidelines for Assessment and Instruction in Statistics Education (GAISE) report and the Statistics and Probability concepts covered by the K-12 Common Core State Standards for Mathematics. I will discuss the impact these programs will have on undergraduate statistics curricula, teacher certification programs and future statisticians.

 
Delayed: Georgetown Massive Data Institute
Delayed: Michael Daconta, Build a Knowledge Base with the my (experimental) software EzKb
  • Install the Easy Knowledge Base Editor (EzKb) Today!
  • For an online article I am writing for my Government Computer News  column (called reality check), I am releasing an alpha release of my Java software called the Easy Knowledge Base Editor (or EzKb for short).  I created a Windows installer and Windows executable that you can download here.  If there is interest I will create a manual install with the Jar files for Linux and MacOS installations.
  • There are some help files (but I need to create many more) and some youtube videos on my youtube  channel to give you an introduction to the software and some of the things that it can do.  Be warned that this is alpha software so it is not feature complete.  It has integrated maps and an integrated wordnet dictionary. 
  • The way to think about this software is that each tab represents a layer in your knowledge base, starting from the smallest layer (a single fact) to more complex layers like things (aka Entities or Classes), to a relationship editor (connect things to create relationships) to rules (if-then constructs) to triggers (when to execute rules) and many other items (like a .csv file import).  I have a grand vision for this software and frankly, not enough time to actually create what I envision.  So, I am releasing it in this alpha state and then will continue to improve it (as time permits).
  • Enjoy!
 
Tutorials
  • 16:35: Demonstration of Inferencing in EZKB by Michael Daconta
  • 5:14 : Demonstration of Creating Triggers in EZKB by Michael Daconta
  • 7:42: Demonstration of Creating Classes and Inheritance in EZKB by Michael Daconta
  • 3:52: Demonstration of Alert Triggers and Rules in EZKB by Michael Daconta
  • 7:42: Intro to Facts in EZKB by Michael Daconta

Twenty-first Meetup, Thursday, November 13, 2014, 4:30 p.m.

Rescheduled to December 15th

First Virginia Big Data Meetup: Data Science for Tackling the Challenges of Big Data, December 15, 2014

  • 6:30 p.m. Welcome and Introduction (New Tutorial and Mentoring) Slides Data Science for Big Data Analytics
  • 7:10 p.m. Brief Member Introductions 
  • 7:15 p.m DATA Act Requirements Thoughts and Open DiscussionArt Nicewick, Executive Consultant, CGI Federal Slides
  • 7:45 p.m.​ Data Act at US Department of Treasury Slides
    • Web Sites: http://fedspendingtransparency.github.io/ and http://fedspendingtransparency.githu.../dataelements/ 
    • Questions: At this time, we are asking for comments in response to the following questions:
      • Which data elements are most crucial to your current reporting and/or analysis?
      • In setting standards, what are industry standards the Treasury and OMB should be considering?
      • What are some of the considerations that Treasury and OMB should take into account when establishing data standards?
  • 8:30 p.m. Open Discussion 
  • 8:45 p.m. Networking 
  • 9:00 p.m. Depart 

Automated Data Science Meetup is Cancelled-Nutonian Event Still Being Held

As the new invited organizer of the Virginia Big Data Meetup and founder and co-organizer Federal Big Data Working Group Meetup, I regret having to cancel the Automated Data Science Meetup starting at 4:30 p.m. this afternoon for the following reasons:

I have not received the agenda, bios and slides of the two presenters.

I have been previously warned by Meetup.com that I cannot use their services to advertise/promote vendors/commercial events.

I have requested that we follow our normal agenda, the content to review, and communicated my concerns, which I have not received a response to.

I am not cancelling the Nutonian event, which you are still free to attend, just the Meetup.com support for it.

Thank you for your support and understanding in this matter.

Brand Niemann

Automated Data Science

Data Science as a Service™ (DSaaS)

Possible Tutorial: Data Science for Data Mining

Source and Register: http://www.nutonian.com/landing/auto...ta-science-dc/

How could Automated Data Science™ change your business?

Overview

Nutonian is introducing Automated Data Science software: automatically analyzing all of your data to reveal the most important cause and effect relationships driving your business. Eureqa empowers businesses to take action by revealing what is happening, why it's happening, and how to optimize for the desired outcome.

Jeremy Funk, a former data scientist at Capital One, will highlight sample use cases from companies like Lowe's, NASA, Rio Tinto, and countless others, who are implementing powerful machine learning techniques, such as classification, by simply uploading a .csv and clicking "run". Following Jeremy, Mark Ettrich, founder of Big Data District, will demo a mobile application that will help customers understand data privacy concerns and give them real-time privacy scoring on their mobile devices.

Meet us at Bar Louie in Ashburn on Thursday, November 13. The event begins at 4:30 with networking/happy hour followed by a presentation at 5. Drinks and appetizers will be provided.

Presenters

Jeremy Funk is a former data scientist at Capital One. Jeremy is now a data scientist at Nutonian, where he helps customers implement new DSaaS technologies to address mission-critical challenges.

Jeremy Funk data scientist Nutonian

Jeremy Funk

data scientist

Nutonian, Inc.

About Nutonian

Nutonian's Data Science as a Service offering, Eureqa®, enables industry leading organizations to solve their most challenging business problems. Founded in 2011 by two of the world's leading data scientists, Nutonian's solutions focus on simplifying data science and making it a core competency for any organization. Nutonian is Where Artificial Intelligence Meets Business Intelligence™.

Twenty-second Meetup, November 18-19, 2014, 8:30 a.m.

CANCELLED BY SPONSOR-TO BE RESCHEDULED

Symposium on Predictive Analytics For Defense and Government Slides

Sheraton Pentagon City
900 S. Orme Street, Arlington, VA

A unique forum where leading experts from Government, DoD, and industry will gather to examine the latest plans, programs, capabilities and challenges in Predictive Analytics for Defense and Government. 

Program Objectives: 

  • What are the latest DoD and Government plans for establishing a predictive analytics capability? 
  • What are the emerging high-value missions and applications for predictive analytics? 
  • What are the current and emerging technical capabilities in predictive analytics? What are the technical challenges? 
  • What are the emerging lessons learned? 
  • What are the latest guidelines for establishing the predictive analytics infrastructure, architecture and culture? 

Free to government and military and discounts to FBDWG Members

Dear Dr. Niemann: It is with great regret that I must inform you that the Program Committee has decided to cancel the Predictive Analytics Symposium, November 18-19, 2014, in Arlington, VA. We are disappointed, as, thanks to your participation and support, we clearly had an outstanding agenda planned. We can only surmise that budget cycles, unforeseen competition, and the timing so late in the year, had something to do with the audience not being larger than we anticipated. We felt it would be better to cancel the event at this time and possibly reschedule the program at a later date when we could provide a more suitable size audience for the outstanding level of speakers we had confirmed. We thank you for your support and sincerely apologize for any inconvenience. Best regards

Mark Mitchell
Technical Coordinator, AIE Predictive Analytics Symposium
541-484-4174

Twenty-third Meetup, Monday, December 1, 2014, 6:30 p.m.

Data Science, Data Infrastructure, & Data Publications for the HHS IDEA Lab

  • 6:30 p.m. Welcome and Introduction – Report on Recent HHS IDEA Lab Demo Meeting with Bryan Sivak (invited) and Damon Davis (invited) and HHS Data Science Data Publication Tutorial Slides Background Data Science for Tackling the Challenges of Big Data (MIT Online Course)
  • 7:00-7:15 p.m. Joe Pringle, Director of Health, Socrata  Slides and Demo Links:
  • 7:15 p.m. Brief Member Introductions and Refreshment Break
  • 7:30 p.m.​ Alex Sherman and Kartik VermaDeloitte Consulting for HHS NIH and DoD MHS, Slides and Demo Link: http://semoss.org/
    • GINAS: Advancing FDA's Ingredient Information System, Noel Southall, National Institutes of Health (also FDA involved) (invited) 
      • FDA has articulated its vision for a next-generation data system that serves as the central clearing house for ingredients in medical products. Meanwhile, the National Center for Advancing Translational Science at NIH has created its own substance tracking system to facilitate research efforts. Working with the FDA, this NIH team will test their software as a solution in the FDA environment.
    • Fostering Scientific Insight through Data Federation, Brock Smith, National Institutes of Health (invited) 
      • This cross-departmental team consisting of individuals representing NIH, FDA and CDC recognizes a problem affecting scientists and their research goals. Because of the breadth and variety of resources, NIH researchers have difficulty synthesizing existing public data with their internally produced research findings and thus can easily lose valuable scientific insight. The team is testing the value of a web platform called SEMOSS that is designed to aggregate existing, fragmented health data while leveraging data analytic and visualization tools to enable scientists’ intuitive analysis and synthesis in their research.
  • 8:30 p.m. Open Discussion
  • 8:45 p.m. Networking
  • 9:00 p.m. Depart

Note: We will be doing data science on the data sets being used in the NIH/FDA GINAS and NIH/FDA SEMOSS Data Federation and Analytics in preparation for this December 1st meetup and all are welcome to participate.

Twenty-forth Meetup, Monday, January 12, 2015, 6:30 p.m.

Data Science for NSF Polar Cyberinfrastructure and MIT Big Data Course

Treeminer has just released an auto-clustering engine using p-Tree algebra & vertical mining. See Video at URL:https://www.youtube.com/watch?v=5X65WV0n4rU

They will give a live demo or run this video at our Meetup.

Autoclustering has tremendous application in all the Government agencies as they need to organize their documents without any supervised learning.

We welcome more live/video demos of big data applications for government.

Government Technology & Innovation Incubator for Big Data Analytics, January 27, 2015, 6:30 p.m.

Federal Big Data Working Group Meetup

Virginia Big Data Meetup

Lotico Northern Virginia Semantic Web

Purpose: Time Critical Because of Eastern Foundry Small Business/Start-Up Benefits and Challenge Cup Opportunity

Opportunity: Challenge Cup 

Companies with government technology products (physical and software) can compete for free space at Eastern Foundry, educational programming on government contracting, product development, and general corporate skills, and access to VCs.  Companies will be selected based on the maturity of their product and the urgency of the need the product would address.

Eastern Foundry: Veteran owned incubator for startups and small businesses interested in the government contracting industry.

Sen. Mark Warner cuts ribbon at Eastern Foundry opening ceremony

The Crystal City incubator opened its doors December 1st and already has 33 companies in their space.

http://technical.ly/dc/2014/12/16/sen-mark-warner-cuts-ribbon-eastern-foundry-opening-ceremony/

Web: Eastern Foundry (http://www.eastern-foundry.com and http://twitter.com/easternfoundry) 

Logistics: Free Parking After 5 p.m. in Underground Garage

Metro Blue and Yellow Lines: Crystal City Station

202-725-7483 and info@eastern-foundry.com

2011 Crystal Drive, 10th Floor, Arlington 22202VA

Agenda

Twenty-fifth Meetup, Monday, February 2, 2015, 6:30 p.m.

Data Science for the National Big Data R and D Initiative

Data Science for MyFamilySearch.org and FamilyTree DNA, February 16, 2015, 6:30 p.m.

IMPORTANT NOTICE: This is remote only due to 100% chance of heavy snow forecast.

Data Science for MyFamilySearch.org and FamilyTree DNA

Twenty-sixth Meetup, Monday, March 2, 2015, 6:30 p.m.

Data Science for Big Data Application and Analytics MOOC

USDA CIO and ACDO on Open Data Plan and Roundtable, March 16, 2015, 6:30 p.m.

USDA CIO and ACDO on Open Data Plan and Roundtable

  • 6:30 p.m. Welcome and Introduction (New Tutorial and Mentoring) Slides Data Science ESIP Publication and Data Science for FDA RFI (April 6th Meetup) using Spotfire Recommendations for Analytic Data Publications
  • 7:00 p.m. Brief Member Introductions
  • 7:15 p.m. Shobhit Chugh, Tamr, Presentation and Demo Slides
  • 7:30 p.m. USDA CIO,  Joyce Hunter, and Acting Chief Data Officer, Bobby Jones, on Open Data Plan and Roundtable Slides
  • 8:30 p.m. Open Discussion​
  • 8:45 p.m. Networking
  • 9:00 p.m. Depart

Government Technology & Innovation Incubator for Big Data Analytics II, March 25, 2015, 6:30 p.m.

Government Technology & Innovation Incubator for Big Data Analytics II

Purpose: Time Critical Because of Eastern Foundry Small Business/Start-Up Benefits and Challenge Cup Opportunity

Opportunity: Challenge Cup 

Companies with government technology products (physical and software) can compete for free space at Eastern Foundry, educational programming on government contracting, product development, and general corporate skills, and access to VCs.  Companies will be selected based on the maturity of their product and the urgency of the need the product would address.

Eastern Foundry: Veteran owned incubator for startups and small businesses interested in the government contracting industry.

Sen. Mark Warner cuts ribbon at Eastern Foundry opening ceremony

The Crystal City incubator opened its doors December 1st and already has 33 companies in their space.

http://technical.ly/dc/2014/12/16/sen-mark-warner-cuts-ribbon-eastern-foundry-opening-ceremony/

Web: Eastern Foundry (http://www.eastern-foundry.com and http://twitter.com/easternfoundry) 

Logistics: Free Parking After 5 p.m. in Underground Garage

Metro Blue and Yellow Lines: Crystal City Station

202-725-7483 and info@eastern-foundry.com

2011 Crystal Drive, 4th Floor, Arlington 22202, VA

Agenda:

  • 6:30 p.m. Welcome and Introduction (Preview of Proposed DATA Act Elements, Standardized Formulas,and Agency Implementation Challenges) 
  • 6:45 p.m. Brief Member Introductions 
  • 7:00 p.m. Chris Garner, Paxata, Inc., Presentation and Demo Slides
  • 7:20 p.m. Steve Hamer, Gov PATH Solution, Presentation and Demo 
  • 7:40 p.m. Open Discussion 
  • 8:00 p.m.​ Government Technology & Innovation Incubator: Eastern Foundry Tour, Geoff Orazem 
  • 8:30 p.m. Networking 
  • 9:00 p.m. Depart

Twenty-seventh Meetup, Monday, April 6, 2015, 6:30 p.m.

Data Science for HealthData.gov Developers & Family Caregivers

President's Chief Data Scientist and EPA Big Data Analytics Meetup, Monday, April 20, 2015, 6:30 p.m.

President's Chief Data Scientist and EPA Big Data Analytics

HHS IDEA LAB: Demand-Driven Open Data

2015 Wharton DC innovation Summit, Federal Big Data Working Group Workshop, April 29, 2015, 1 p.m.

2015 Wharton DC Innovation Summit, CONNECT, CREATE, TRANSFORM - APRIL 28-29, 2015

Agenda Day 2: Workshop - Federal Big Data Working Group - Location 3, 1-2:30 p.m. See Logistics Below: Map and Agenda

2015 Wharton DC Innovation Summit, April 28-29: FBDWGM Workshop 4/29 1-2:30 p.m.

Katherine Goodier, Moderator, Brief Presentations and Discussion All Slides

  • Brand Niemann, Data Science for Big Data
    • Brand Niemann, former Senior Enterprise Architect & Data Scientist with the US EPA, works as a data scientist, produces data science products, and publishes data stories for Semantic Community, AOL  Government, & Data Science & Data Visualization DC. He founded and co-organizes the Federal Big Data Working Group Meetup.
  • Mary Anne Sterling, innovation, Big Data, and the Family Caregiver
    • MaryAnne is the Co-founder of Connected Health Resources and the CEO of Sterling Health IT Consulting. She has been a thought leader in health information technology and health policy for over a decade, focused on healthcare transformation through the eyes of patients and their families. She serves as Ambassador for both the Alzheimer’s Association and the Patient Centered Outcomes Research Institute (PCORI). MaryAnne also serves on the ONC Health IT Policy Committee’s Consumer Workgroup and is a former Executive in Residence for the Healthcare Information Management and Systems Society (HIMSS) Foundation, Institute for e-Health Policy. She represents the Society for Participatory Medicine on the HIMSS Patient and Family Caregiver Advisory Council. MaryAnne has been a caregiver for her aging parents for 18+ years. She is a renowned speaker and educator on family caregivers and their health information technology needs and often shares her personal experience with multiple parents struggling with dementia. Her personal story has been featured in Kiplinger, New York Times, USA Today, and the Wall Street Journal. She received her Bachelor’s Degree in Biology and Biotechnology from the University of Nebraska at Omaha.
  • RK Paleru, Right Brain Business Analytics
    • RK Paleru is a Business Intelligence and Data Analytics Executive with over 20 years experience helping global banks, corporations and non-profits extract business insight from data. Most recently he was the Executive Director at one of the nation's prestigious higher education institution where he lead the Business Intelligence and Analytics initiatives for the CFO and CIO. Mr. Paleru's expertise rests at the intersection of Business and Information Analytics domains, having worked for industry leaders such as IBM Cognos, Oracle /Citicorp, MicroStrategy and ICICI Bank. Mr. Paleru holds an MBA from University of Bombay and Masters in Computer Science from University of North Carolina

Logistics

  • When: Tuesday, April 28, and Wednesday, April 29, 2015, 8:00am - 6:00pm. Networking Reception to follow each evening.
  • Where: FHI 360 Conference Center, 1825 Connecticut Avenue, NW, 8th Floor (Universal Building South), one block south of Washington Hilton. Off-site reception on Tuesday (location TBA) and at FHI 360 on Wednesday, 4/29.
  • Transportation: Don't call a taxi or Uber - let our transportation sponsor give you a ride in a luxury vehicle.
  • Metro: Dupont Circle / Red Line / just 4 blocks away.
  • Parking: Garage parking is located at 2005 Florida Avenue, between Connecticut Avenue and T Street and is open between the hours of 7am and 11pm.

Twenty-eight Meetup, Monday, May 4, 2015, 6:30 p.m.

Data Science for Natural Medicines and Epigenetics

USDA Data Science MOOC Meetup, Monday, May 18, 2015, 6:30 p.m.

USDA Data Science MOOC

  • 6:30 p.m. Welcome and Introduction (New Tutorial and Mentoring) Slides
  • 6:45 p.m. USDA Data Science MOOC
  • 7:15 p.m. Brief Member Introductions
  • 7:30 p.m. USDA ACDO and Data Stewards, and Joel Gurin, President and Founder, The Center for Open Data Enterprise (Invited)
  • 8:30 p.m. Open Discussion
  • 8:45 p.m. Networking
  • 9:00 p.m. Depart

Data Science for NIST Big Data Framework Review by May 21, 2015

Data Science for NIST Big Data Framework Review by May 21st

Wo and Nancy, Congratulations on excellent work over a long period of time!

I am asking the 700+ members of our Federal Big Data Working Group Meetup to review your DRAFT documents and provide comments.

To facilitate that process and my own comments, I am doing that at:

http://semanticommunity.info/Data_Science/Data_Science_for_NIST_Big_Data_Framework

My initial comments are at:

http://semanticommunity.info/Data_Science/Data_Science_for_NIST_Big_Data_Framework#Research_Notes

I think this will take us longer than the May 21st schedule and we plan to do a Meetup on this in July.

We are looking especially for the 6 Uses Cases that have data sets according to a recent email we saw from your participants.

Best regards, Brand

Dear Brand,

Thanks for your encouragement and special thanks for your Federal Big Data WG Meetup for willing to review and help to enhance the content of our seven NBD-PWG documents!

Yes, the 45 days public comment period is kind of short but the good news is, our future version 2 and 3 will be built on top of version 1 meaning any late comments can still be useful for our enhancement. Thanks for starting the commenting on Vol. 1 and 2, and in the process of commenting Vol. 3. I would be very appreciative if you can send in a version before May 21 so that our editing team can review and incorporate any appropriate incoming comments.

Thanks so much for your help and looking forward for your Meetup’s comments!

--Wo

Data Science for Health Datapalooza 2015, June 1-3, 2015

Data Science for Health Datapalooza 2015

As usual we are going to do Data Science for Health Datapalooza 2015 Datalab data sets, so we hope to get more details on what will be released and discussed so we could get a head start at:

http://semanticommunity.info/Data_Science/Data_Science_for_Health_Datapalooza_2015

which provides a more structured agenda for attendance.

Recall that our April 20th Meetup, supports DJ’s Precision Medicine Initiative with our own Natural Medicine and Epigenetics Initiative, and David Portnoy's DDOD Initiative with uses cases. There will be sessions on these on June 3rd and June 2nd respectively.

Our July 20th Meetup is Data Science for ACA Data:

http://www.meetup.com/Federal-Big-Data-Working-Group/events/222369666/

co-organized by MarkLogic, HHS, Optum, and Ericcson.

For registration, please see:

http://healthdatapalooza.org/registration-3/

Metro Subway Station: Woodley Park-Zoo/Adams Morgan (Red line)  0.1 miles SE 

Twenty-ninth Meetup, Monday, June 1, 2015, 6:30 p.m.

Data Science for Homeless Data: QlikView. Tableau, & Spotfire Bakeoff 

Please note that I did quick tutorial slides for Tableau since their representative had a last minute scheduling problem and we will hopefully have them at a future date since other BI Platforms have expressed interest as well.

Data Science for USGS Minerals Big Data Meetup, Monday, July 13, 2015, 6:30 p.m.

Data Science for USGS Minerals Big Data

  • 6:30 p.m. Welcome and Introduction (New Tutorial and Mentoring) Slides  EarthCubeKey DocumentsEarthCube Data Science Publications and Key Documents
  • 6:45 p.m. Brief Member Introductions
  • 7:00 p.m. Dr. Sophia B. Liu, U.S. Geological Survey (USGS) Mendenhall Postdoc Fellow, Slides; USGS Civic Hacking Challenges on Hackpad and Slack 
  • 7:15 p.m. Brief Comments from Subject Matter Experts in the USGS Energy, Minerals, and Environmental Health Programs (invited)
  • 7:30 p.m.​ Brand Niemann, Data Science for USGS Minerals Big Data
  • 7:45 p.m. Open Discussion: Opportunity to show and tell proof-of-concepts, provide recommendations, and discuss opportunities and challenges with producing data science publications at USGS
  • 8:45 p.m. Networking
  • 9:00 p.m. Depart

Example Dynamically Linked Interactive Visualization: 2014 US Net Import Reliance

2014 US Net Import Reliance.png

 

 

Government Technology & Innovation Incubator for Big Data Analytics III, June 20, 2015, 10 a.m.

Eastern Foundry to Host Challenge Cup Demo Day on Combatting PTSD

Information on Foundry Cup, Office Hours, and More! 

Join us at the Foundry Cup Demo Day on June 20th!  

The Summer 2015 Demo Day will showcase 14 finalists that are seeking to provide innovative technologies and services to detect, alleviate, and/or provide support to soldiers and veterans suffering from post-traumatic stress disorder (PTSD). The finalists will present their concepts in front of a panel of judges and live audience for the opportunity to win cash and professional development prizes. 

Please note: This event is open to the public. Beverages and concessions will be provided. There will also be free raffle drawings to win prizes such as free international round trip flights, a 40'' LED television, and more! RSVP here.

Here's what you can expect: 

  • 10:00 am: The day will kick off with check-in, networking, and a chance to enter into raffles for prizes! 
  • 10:45 am: Introductions will be made and pitches will begin. 
  • 12:30 pm: Presentations will take a break and a full-course lunch will be served. 
  • 1:30 pm: Presentations will continue. 
  • 2:30 pm: Everyone will break for refreshments and have a chance to network at the Marketplace while judges make the final decisions to determine winners of the competition. The Marketplace will have tables set up showcasing the participants' products/services. 
  • 3:30 pm: Award Ceremony

Foundry Cup is a three-day innovation competition (June 18-20) convening individuals and companies to collaborate on projects that address some of the toughest issues facing our nation’s veterans. The Foundry Cup is a bi-annual initiative and this spring's Cup is honing in on individuals and groups that provide technologies and services to detect, alleviate and/or provide support to soldiers and veterans suffering from PTSD. All 14 finalists will be competing for both cash and professional development prizes.

Don't Forget to Register for Office Hours

Eastern Foundry will be hosting guest services once a month around topics such as legal advice, business development advice, and financial advice. Office Hours is FREE for everyone - members and non-members alike. Don't miss this opportunity to take advantage of our partnerships with experts from the best firms in Washington. The 30-minute private sessions will fill up fast; Register here

Events 

Trending 40 - Federal Innovators and Entrepreneurs

Thursday, June 11 6:00pm - 8:00pm

1776 Crystal City - 2231 Crystal Drive, Suite 1000, Arlington, VA 22202  

Eastern Foundry Office Hours With Brian Lindholm - Service: Business Development  

Wednesday, June 17 1:00pm - 4:00pm

Eastern Foundry Offices - 2011 Crystal Drive, Suite 400, Arlington, VA 22202

Foundry Cup Challenge Demo Day

Saturday, June 20 10:00am - 4:00pm 

Penthouse - 2011 Crystal Drive, 10th Floor, Arlington, VA 22202

Eastern Foundry Office Hours With Barbara Morgan - Service: Accounting

Wednesday, June 24 10:00am - 1:30pm 

Eastern Foundry Offices - 2011 Crystal Drive, Suite 400, Arlington, VA 22202

Eastern Foundry Office Hours With Gordon Griffin - Service: Legal

Wednesday July 29 9:00am - 1:30pm 

Eastern Foundry Offices - 2011 Crystal Drive, Suite 400, Arlington, VA 22202

Government Procurement Related News:

Top 5 Tips for Successfully Selling Your Government Contracting Business 

For owners with an interest in selling their government contracting businesses, market conditions could not be much better.

Oh No, Other Recent Inquiries Expose More Problems With the Government's Flawed Airplane Security 

The nation buzzes with the study that the Transportation Safety Administration (TSA) let through 95% of testers slipping dangerous contraband through passenger screening.

US Federal Contractor Registration: Apply for the DAPA (Distribution and Pricing Agreement) Registration in 2015

US Federal Contractor Registration is encouraging businesses to become registered in the DAPA Registration in order to sell pharmaceutical and medical products to contracting officers at DLA Troop Support.

Government Procurement Related Events:

Techonomy Policy 
The program is a big-picture conversation about the forces altering the fundamental relationship between tech and government. Speakers include Senator Cory Booker, Steve Case, Vint Cerf, FCC Commissioners O’Rielly and Rosenworcel and Sean Parker.

National SBIR/STTR Conference 
This is one of the largest events in the world focused on innovation-driven, high-growth and STEM-focused inventive small businesses and entrepreneurs.

TechVets Meetup 
Connect with other veteran entrepreneurs working in the tech field. 

A Note From the Eastern Foundry Team

We're really excited to create an avenue that brings together like-minded professionals in a collaborative atmosphere. Our journey is just getting started and we'd love to have you along for the ride.

We plan on keeping you updated with EF related news along with content on government contracting topics such as RFPs that have recently dropped, new technologies in the government, and changes to the FAR. You can expect updates from us every Monday morning starting next week. We couldn't do this without you so let us know how we're doing; we love your feedback! Connect with us - We have exciting news coming up.

Copyright © 2015 Eastern Foundry, All rights reserved.  

Our mailing address is:  

Eastern Foundry 
2011 Crystal Drive 
Suite 400 
Arlington, VA 22202

Thirtieth Meetup, June 29, 2015, 6:30 p.m.

Data Science for Cyber Physical Systems-Internet of Things

Special Meetup at the NSF, July 15, 2015, 1 p.m.

See Meetup.com: Data Science for RDA Climate Change Data Challenge

See Semantic Community: Data Science for RDA Climate Change Data Challenge

Thirty-first Meetup, July 20, 2015, 6:30 p.m.

Data Science for ACA Data & Semantic Medline Precision Medicine

This Meetup is being organized by Mary Galvin & Emily Minnich (MarkLogic), Chris Thompson (HHS-CMS, but no endorsement implied), Lloyd Brodsky (Optum), Joseph Siryani (Ericsson.com), and Brand Niemann (Semantic Community)

  • 6:30 p.m. A Tale of Federal Marketplaces, Kash Badami, Mark Logic Slides
  • 7:00 p.m. ACA Data Team Slides
    • A story of HIX
    • An overview of the Federal HIX and DSH
    • The Past: What went wrong and how we fixed it
    • 2014 A remarkable success
    • What does the future hold for HIX and Insurance nationwide
  • 7:15 p.m. Brief Member Introductions
  • 7:30 p.m. Data Science for Affordable Care Act Data Slides Slides
  • 7:45 p.m. Semantic Medline Precision Medicine, Dr. Sreenivas Rangan Sukumar, Research Scientist, Health Data Sciences Initiative and Computational Sciences and Engineering Division, Oak Ridge National Laboratory. Slides See: Data Science for Health Datapalooza 2015
  • 8:30 p.m. Open Discussion
  • 8:45 p.m.  Networking
  • 9:00 p.m. Depart

Data Science for Agency Initiatives 2015, August 3, 2015, 6:30 p.m.

Data Science for Agency Initiatives 2015

Listen to CFPB Data Manager, get Consumer Complaint Database, and see Data Science on that data set!

  • 6:30 p.m. Welcome and Introduction (New Tutorial and Mentoring) Slides Data Science for Agency Initiatives 2015
  • 7:15 p.m. Brief Member Introductions
  • 7:30 p.m.  Chad Tompkins, Section Chief, Data Section, Office of Consumer Response (suggested by (Linda F. Powell, Chief Data Officer, Consumer Financial Protection Bureau) Consumer Complaint Database Slides (not cleared for public release)
  • 8:15 p.m.​ Open Discussion
  • 8:45 p.m. Networking
  • 9:00 p.m. Depart

A NIH – Semantic Medline Data Science Data Publication Commons, August 17, 2015, 6:30 p.m.

A NIH – Semantic Medline Data Science Data Publication Commons

Listen to the NIH Semantic Medline Developer, Get Semantic Medline Database, and See Data Science Data Publication on NIH/NLM Data Sets and Learn About the NIH Commons! 

The Yosemite Project for Healthcare Information Interoperability August 31, 2015, 6:30 p.m.

The Yosemite Project for Healthcare Information Interoperability

Hear David Booth talk about The Yosemite Project: an ambitious roadmap for achieving semantic interoperability of all structured healthcare information and hear Professor Barry Smith talk about his new book: Building Ontologies with Basic Formal Ontology

TITLE: The Yosemite Project for Healthcare Information Interoperability

SPEAKER: David Booth, PhD. HRG and Rancho BioSciences

ABSTRACT: 
Healthcare data originates in a wide variety of data formats, data models and vocabularies, making information interoperability a major challenge. Although many standards exist, and each one strives for a level of interoperability within its scope, in the aggregate these standards form an uncoordinated patchwork that thwarts interoperability.

Furthermore, even when standards are used, translation between data formats, data models and vocabularies is still needed, for a variety of reasons.

The Yosemite Project describes an ambitious roadmap for achieving semantic interoperability of all structured healthcare information.

Based on RDF as the best available candidate for a universal information representation, this roadmap addresses both the need to 'standardize the standards' and the opportunity to crowdsource data translations that are still required for information interoperability.

BIO: 
David Booth is an independent consultant and senior software architect at HRG and at Rancho BioSciences, using Semantic Web technology to make healthcare and biomedical data interoperable between diverse systems. He previously worked at KnowMED, using Semantic Web technology for healthcare quality-of-care and clinical outcomes measurement, and at PanGenX, applying Semantic Web technology to genomics in support of personalized medicine. Before that he worked on Cleveland Clinic's SemanticDB project, which uses RDF and other semantic technologies to perform cardiovascular research. Prior to that was a software architect at HP Software, where his primary focus was emerging technologies. He was a W3C Fellow from 2002 to 2005, where he worked on Web Services standards before becoming involved in Semantic Web technology. He has been programming for many years using a variety of programming languages and operating systems. He holds a PhD in Computer Science from UCLA.

For information on other webinars in the Yosemite Project series, visit http://YosemiteProject.org.

Building Ontologies with Basic Formal Ontology By Robert Arp, Barry Smith and Andrew D. Spear

In the era of “big data,” science is increasingly information driven, and the potential for computers to store, manage, and integrate massive amounts of data has given rise to such new disciplinary fields as biomedical informatics. Applied ontology offers a strategy for the organization of scientific information in computer-tractable form, drawing on concepts not only from computer and information science but also from linguistics, logic, and philosophy. This book provides an introduction to the field of applied ontology that is of particular relevance to biomedicine, covering theoretical components of ontologies, best practices for ontology design, and examples of biomedical ontologies in use.

After defining an ontology as a representation of the types of entities in a given domain, the book distinguishes between different kinds of ontologies and taxonomies, and shows how applied ontology draws on more traditional ideas from metaphysics. It presents the core features of the Basic Formal Ontology (BFO), now used by over one hundred ontology projects around the world, and offers examples of domain ontologies that utilize BFO. The book also describes Web Ontology Language (OWL), a common framework for Semantic Web technologies. Throughout, the book provides concrete recommendations for the design and construction of domain ontologies.

Big Data Science for Precision Farming Business September 14, 2015, 6:30 p.m.

USDA Big Data Science for Precision Farming Business

USDA Collaboration Chronology (see below), Preview of Online Course, and Commercial Examples: Farmers Business Network, FarmLogs, etc.

USDA Collaboration Chronology 

  • March 16th: USDA CIO and ACDO on Open Data Plan and Roundtable Meetup
  • March 25th: Government Technology & Innovation Incubator for Big Data Analytics II Meetup at Eastern Foundry
  • May 18th: USDA Data Science MOOC Meetup
  • May 21st, USDA Open Data Quarterly Submission to OMB on USDA Data Usage provided (USDA Data Science MOOC)
  • July 21st, Data-Driven Farming Online Course Announced by HeatSpring and Semantic Community
  • July 27th: USDA Microsoft Innovation Challenge Submission on Farm Data Dashboards
  • July 29th, Partnerships Sought for Data-Driven Farming Online Course
  • September 14th: Big Data Science for Precision Farming Business Online Course Meetup and Commercial Examples: Farmers Business Network, FarmLogs, etc.
  • October 26-December 18th, Data-Driven Farming Online Course with Partners

Climate Change & Genomic Data - Data Science Meetup of Meetups September 28, 2015, 6:30 p.m.

Climate Change & Genomic Data - Data Science Meetup of Meetups

Preview of the Upcoming OSTP/NSF Data Science Meetup of Meetups on November 5-6, Curated Data Sets for Climate Change and Genomic Data Science, and NIH/NLM/NCBI Genomic Data

Data Science for EPA & USGS Fracturing & Fracking­­­­­ Data, October 5, 2015, 6:30 p.m.

Data Science for EPA & USGS Fracturing & Fracking­­­­­ Data

Get a Preview of National Data Science Organizers Workshop on November 5-6, 2015, and the Focus on National Data Science Challenges and Hackathons

Data Science for EPA's Chief Data Scientist: Big Data for Nutrients and Air Quality, October 19, 2015, 6:30 p.m.

Data Science for EPAs Chief Data Scientist: Big Data for Nutrients & Air Quality

Get Another Preview of National Data Science Organizers Workshop on November 5-6, 2015, and the Focus on National Data Science Challenges and Hackathons

Please Note:

This Meetup was organized for:

for the National Data Science Organizers Workshop on November 5-6, 2015, as an example of:

  • data science for curated data sets,
  • user-centric digital services focused on the interaction between government and the people and businesses it serves, and
  • a Federal Community of Practice on Crowdsourcing and Citizen Science of Big Data that meets bi-monthly to share lessons learned and develop best practices for designing, implementing, and evaluating crowdsourcing and citizen science initiatives.

Data Science for Random Forests: TIBCO Enterprise Runtime for R, November 2, 2015, 6:30 p.m.

Data Science for Random Forests: TIBCO Enterprise Runtime for R

Get A Final Preview of National Data Science Organizers Workshop on November 5-6, 2015, and the Focus on National Data Science Challenges and Hackathons. Also Learn About Two Popular Tools (Random Forests and R) That Are Widely Used

Special Data Science Meetup of Data Science Meetups, November 5-6, 2015

See Photos

Special Data Science Meetup of Data Science Meetups

http://www.nationalprioritychallenge.org/

Potential Sessions for Meetup Meeting: November 5-6, 2015 (Please note: This will be in-person by invitation and remote for all)

National Data Science Organizers Workshop

Potential Sessions for November 5-6, 2015

Day 1 - November 5th, 2015 (Half Day)

12:00 pm (Pre-conference Lunch with Big Data Regional Hubs Leaders) 

1:30 pm Session 1: Data Science for the Nation

Keynote: What are the National Priorities?, White House Office of Science and Technology Policy - Deputy Director for Technology and Innovation

Impacts of Data Science on National Priorities

• Data Kind: Speaker

• Data Science for Social Good: Speaker

• Federal Meetup: Speaker

Discussion: Using Meetups to explore National Challenges

5:00 pm: Evening Event at AAAS: Grassroots Data Science Across the Nation

• Lighting Talks: Every group gets 10 slides and 3 minutes

• Highlight past events in National Priority Areas or of national interest, state plans for the future, and give challenges, and ideas for how a Network of Data Science Organizers can solve national problems.

• Networking Reception: Highlight AAAS, S&T Fellows, and Affinity Groups

Day 2 - November 6th, 2015 (Full Day)

8:00 am Session 2: Exposing Data

Available Datasets: Speakers

• Socrata Open Data Portal demo: Speaker

• Open Data.gov / Open Data Working Group: Speaker

Exposing data resources

• Meetup Contributions

Product Creation: Connecting data sources among regions.

10:00 am Break

10:30 am Session 3: Coordination and Support of Data Science Meetups

Resources for Meetups:

• Federal Support for Meetup groups: Chaitan Baru Senior Advisor for Data Science NSF

Coordination mechanisms:

• You Tube Channel, Podcast, White Papers, listserv

• Meetup of Data Science Meetup groups Online

Discussion: Mechanisms to spread good ideas among regions.

12:30 pm Lunch Speaker

1:30 pm Session 4: The National Priority Challenge

• National Priority Challenge-Speaker

• National Data Science Challenges and Hackathons: Proposed by steering committee

• RDA Research Data Alliance (RDA): P8 venue to announce specific challenge (2016)

Working Session: Launching National Priority Challenge 2016

5:45 pm Closing Remarks: TBA

Background: 

This is what I have been working on and telling you was coming: Data Science Data Curation for Sustainable Data Science Meetups of Meetups

I just finished four data science data ecosystems:

• RDA Climate Data Challenge (July 15): 
http://semanticommunity.info/Data_Science/Data_Science_for_RDA_Climate_Change_Data_Challenge 

• RDA Information Week 2016 (Ebola Response and Nepal Earthquake) (July 17): 
http://semanticommunity.info/Data_Science/Data_Science_for_Global_Ebola_Response_Data 

• USDA Microsoft Innovation Challenge (July 27): 
http://semanticommunity.info/Data_Science/Big_Data_Science_for_Precision_Farming_Business#Story 

• US Data Act (July 28): 
http://semanticommunity.info/Data_Science/Data_Science_for_the_DataAct_Datathon

Please work with one or more of these data science data ecosystems and post a link to your results so we can decide who to invite to present.

Update: We have now added Genomic Data. See NSF Data Science Workshop 2015 for September 28th Meetup

Please follow our Data Mining - Data Science – Data Publication Process

Data Mining Process: 
• Business Understanding 
• Data Understanding 
• Data Preparation 
• Modeling 
• Evaluation 
• Deployment 

Data Science Process: 
• Data Preparation 
• Data Ecosystem 
• Data Story

Data Science Questions: 
• How was the data collected? 
• Where is the data stored? 
• What are the data results? and 
• Why should we believe the data results? 

Data Science Data Publication: 
• Knowledge Base 
• Spreadsheet Index 
• Web & PDF Tables to Spreadsheet 
• Data Browser 
• Dynamically Linked Adjacent Visualizations

The above links provide examples of implementing these.

Data Science for the DataAct Datathon, To Be Rescheduled

Data Science for the DataAct Datathon

  • 6:30 p.m. Welcome and Introduction (New Tutorial and Mentoring) Slides Data Science for the DataAct Datathon 
  • 7:00 p.m. Brief Member Introductions 
  • 7:15 p.m. Team member Presentations: Allyson Ugarte, Steve Hanmer, Chris Crawford, and Peter Morosoff
  • 8:15 p.m. Open Discussion 
  • 8:45 p.m. Networking 
  • 9:00 p.m. Depart

3rd Annual Big Data for Intelligence Symposium, Nov. 17-18, 2015

3rd Annual Big Data for Intelligence Symposium, Nov. 17-18, 2015

Harnessing the Power of Big Data for The Intelligence Community
Alexandria, VA
http://bigdatasymposium.dsigroup.org/

Presentation: National Priorities for Big Data

Dr. Brand Niemann
Director and Senior Data Scientist/Data Journalist
Semantic Community
http://semanticommunity.info
Founder and Co-Organizer
Federal Big Data Working Group Meetup
http://www.meetup.com/Federal-Big-Data-Working-Group/

The White House Office of Science and Technology Policy (OSTP) and the  National Science Foundation (NSF) convened a National Data Science Organizers Workshop, November 5-6, 2015, to discuss 1. Data Science for the Nation National Priorities, Impacts of Big Data Science on National Priorities, and Using Meetups to Explore National Challenges, 2. Exposing Data; 3. Coordination and Support of Data Science Meetups; and 4: The National Priority Challenge.

The results of this workshop will be summarized along with highlights from the Federal Big Data Working Group Meetup, for which the presenter is the Founder and Co-Organizer.

Examples of what the Federal Big Data Working Group Meetup has done from 2014-present to provide big data science tutorials and Massive Open Online Courses (MOOCs), curated government datasets, and citizen science and crowdsourcing in support of the White House Open Science and Innovation: Of the People, By the People, For the People, as part of the President's 2013 Second Open Government National Action Plan. Open Science and Innovation: Of the People, By the People, For the People.

Note Part of the Abstract: In the 2013 Second Open Government National Action Plan, President Obama called on agencies to harness the ingenuity of the public by accelerating and scaling the use of open innovation methods such as citizen science and crowdsourcing. This forum brings together citizen science professionals, researchers, and stakeholders from local, state, Federal, and Tribal governments; academia; non-profits; and the private sector to celebrate the contributions of crowdsourcing and citizen science to enhancing agencies’ missions, and scientific and societal outcomes. #WHCitSci

https://www.whitehouse.gov/live/open...-people-people

DRAFT AGENDA FOR REVIEW ONLY. INVITED SESSIONS ARE SUBJECT TO CHANGE

Tuesday November 17, 2015

8:00 -  8:45

Registration and Light Breakfast Reception Open

8:45- 9:00

Chairman’s Opening Remarks

9:00- 9:45

Opening Keynote Remarks:

Big Data and the Need for Information Environments

-Update on the IC’s migration to IC-ITE

-Taking advantage of cloud computing and the necessary security enhancements

-Utilizing predictive analytics in support of information security and enhanced intelligence

-Monitoring where sensitive data is and who has access to it on a real-time basis

Dr. Raymond Cook (Invited) Assistant DNI and IC CIO ODNI

9:45 – 10:30

US Army Priorities for Utilizing Big Data in Support of an Enhanced Intelligence Enterprise

-Integrating critical multi-discipline intel capabilities in all layers to support regionally aligned Army

-Maturing The US Army’s ability to leverage the national to tactical enterprise ISO expeditionary/distributed ops

-Setting conditions to ensure Army’s alignment with evolving IC ITE, DoD JIE, Army MC COE requirements

Annette Redmond, SES (Confirmed)

Director, Intelligence Community Information Management Office of the Deputy Chief of Staff, HQDA G-2

10:30 – 11:15

Networking Break

11:15-12:00

Development of Technology and Tools to Maximize Insight from Large Unstructured Data Sets

-State-of-the-art core technologies needed to collect, store, preserve, manage, analyze, and share BIG DATA that could benefit from standardization

-Potential measurements to ensure the accuracy and robustness of methods that harness these technologies

Dr. Ashit Talukder (Confirmed)

Division Chief, Information Access Division NIST

12:00- 1:15

Networking Lunch

1:15 – 2:00

Utilizing Big Data to Enhance the USMC Intelligence Enterprise

-MCISRE Enterprise capabilities

-Connecting users to product development

BGen Michael Groen, USMC (Invited)

Director

HQMC Intelligence Department

2:00 – 2:45

Utilizing Big Data and Predictive Analytics to Enhance Enterprise Effectiveness

COL Bobby Saxon, USA (Invited)

Division Chief and Program Director HQDA G-3/5/7

 

 

2:45– 3:15

Networking Break

3:15 - 4:00

Integrating Future Operations: ISR in the Combat Cloud

  • Integrating the Intelligence Community with the Warfighter
  • Empowering analytics and innovation
  • Automating sensor networks and  battlespace networking

Jeffrey Eggers, SES (Confirmed)

Chief Technology Officer

Deputy Chief of Staff for ISR, HQAF

4:00 – 4:45

National Priorities for Big Data

-Data science for the nation: Impacts of big data science on National priorities

-Data science for tackling the challenges of big data

-Developing people, processes, and products for the Federal Government

Dr. Brand Niemann (Confirmed) Founder and Co-Organizer Federal Big Data Working Group Slides

4:45

End of Day 1

 

Wednesday, November 18, 2015

8:15 -

8:45

Registration and Light Breakfast Reception Open

8:45 -

9:00

Chairman’s Opening Remarks

9:00 -

9:45

Utilizing Big Data to Enhance Intelligence Value

Mike Bender (Confirmed)

Director, Laboratory of Analytic Sciences North Carolina State University

9:45 -

10:30

Keynote Remarks:

Utilizing the Influx of Big Data to Enhance Geospatial Intelligence

-Empowering Geospatial Intelligence production with Big Data Analytics

-Crowdsourced mapping for Geospatial Big Data Analytics

-Speeding up the acquisition process to transition critical technology into the NGA enterprise

Susan Gordon (Invited)

Deputy Director NGA

10:30 - 11:00

Networking break

11:00-11:45

Initiatives at NRO to Maximize Timely Intelligence Production through Advances in Big Data Analytics

-Reducing time for detection, collection, processing & decision-making

-Utilizing advanced analytics to attack hard problems

-Big Data as part of the “Multi-INT” solution

Terry Duncan (Invited)

Director, Communications Systems Directorate NRO

 

 

11:45 - 12:30

Innovative Analytics Efforts to Gain Actionable Intelligence from Big Data

-Equipping the analytic enterprise with the tools, skills and personnel needed to harvest transformative insights from Big Data

-Creating new technology to enhance analytic efficiency and apply data science methods

Catherine Johnston (Confirmed)

Director for IC ITE and Digital Transformation Defense Intelligence Agency

12:30 – 1:30

Networking Lunch

1:30 – 2:15

Utilizing Big Data to Inform DoD Acquisition

  • Data-driven decisions from the program office, to Department decision makers, to Capitol Hill
  • Data stewardship, access, and analysis
  • The Defense Acquisition Visibility Environment

Mark Krzysko (Confirmed)

Deputy Director, Enterprise Information OUSD (AT&L)

2:15 – 3:00

Empowering the US Navy Intelligence Core Analytic Enterprise with Big Data

-Harnessing data for the warfighter

-All source analytics for big data

-US Navy requirements for industry

B. Lynn Wright (To Be Invited) Deputy Director of Naval Intelligence DCNO N2/N6

3:00

End of Symposium

 

Data Science and Semantic Insights for DoD Joint Doctrine, December 7, 2015, 6:30 p.m.

Data Science and Semantic Insights for DoD Joint Doctrine

  • 6:30 p.m. Welcome and Introduction (New Tutorial and Mentoring) Slides Data Science for Joint Doctrine 
  • 6:45 p.m. Brief Member Introductions 
  • 7:00 p.m. Chuck Rehberg, CTO and and Dave Decker, Semantic Insights, Slides, Steve Hambry, VP Product Development, Securboration, Slides, and Barry Smith, Distinguished Professor, and Director, National Center for Ontological Research, SUNY Buffalo, Steve Hambry, VP Product Development, Securboration, and Barry Smith, Distinguished Professor, and Director, National Center for Ontological Research, SUNY Buffalo, Slides
  • 8:30 p.m. Open Discussion 
  • 8:45 p.m. Networking 
  • 9:00 p.m. Depart

Data Science for Semantics: MarkLogic and Cray Graph Appliance Update, January 4, 2016, 6:30 p.m. TO BE RESCHEDULED

Data Science for Semantics: MarkLogic and Cray Graph Appliance Update

Data Science for American Community Survey, February 1, 2016, 6:30 p.m.

Data Science for American Community Survey

  • 6:30 p.m. Welcome and Introduction (New Tutorial and Mentoring) Slides Data Science for American Community Survey See example of Census Visualization Gallery below
  • 7:15 p.m. Brief Member Introductions
  • 7:30 p.m. Invited Presentation, David Raglin, Chief, ACS Survey Analytics and Measurement Branch, Census Bureau. See: Introduction to the American Community Survey Description: Discover the detailed social, economic, and housing statistics that the American Community Survey (ACS) provides for every community every year. You will learn about basics of the yearly estimates and datasets produced from the ACS, resources available on our website (http://census.gov/acs), and how to access ACS data products through a variety of tools with emphasis on AmericanFactFinder. Date: August 25, 2015, Time: 2pm-3pm ET, Level: Introductory
  • 8:15 p.m. Open Discussion
  • ​8:45 p.m. Networking
  • 9:00 p.m. Depart

128th CNSTAT Meeting Public Seminar
"Reengineering the 2020 Census"
Friday, October 23, 2015 

National Academy of Sciences Building
2101 Constitution Avenue NW
Washington, DC 20418
Lecture Room
Download the agenda My Note: See Below
Register now!

The 2020 Census will be like no decennial census before it. The Census Bureau is reengineering key processes to make the 2020 count the most technologically advanced decennial census ever taken—including innovative techniques and technology as well as new data sources—while still producing quality results. Beginning early in this decade, the Census Bureau’s research and testing has focused on those innovations that showed the most promise of significant cost avoidance, with the goal of using the results to outline the general shape of the 2020 Census by the end of FY 2015. Those decisions have now been made and documented in the 2020 Census Operational Plan for release in early October. The Census Bureau speakers will summarize the design decisions and the associated estimates of cost savings. They also will cover the research and testing the Census Bureau has done in four major innovation areas, with particular focus on results from four major tests conducted in 2015: 

  • The Human-in-the-Loop simulation test of many aspects of field reengineering;
  • The Address Validation Test of using aerial imagery and other external sources to update the Master Address File (MAF);
  • The 2015 Optimizing Self-Response Test of ways to use advertising, partnerships, and mail contact strategies to maximize self-responses (particularly via the Internet) and the feasibility and benefits of allowing people to respond without a Census Bureau-supplied identification number; and
  • The 2015 Census Test of reengineered roles, responsibilities, and infrastructure for field operations, increased field automation, and use of administrative records and adaptive design to reduce and streamline personal visits in the field.

Discussants Joseph Salvo and Donald Dillman will comment, among other topics, on the Census Bureau’s plans to revise nonresponse follow-up strategies (and their effects at local levels) and to maximize Internet take-up in initial response.

Agenda
1:30 PM - Light Refreshments for Seminar Guests (First Floor East Court)
2:00 PM - Welcome, Lawrence Brown, CNSTAT Chair and the University of Pennsylvania
2:05 PM - Developments at the OMB Statistical and Science Policy Office, Katherine Wallman, Chief Statistician, U.S. Office of Management and Budget
2:15 PM - Tribute to Janet Norwood, Constance Citro, Director, CNSTAT
2:35 PM - Featured Topic: Reengineering the 2020 Census,  John Thompson, Director, U.S. Census Bureau, and Lisa Blumerman, Associate Director for Decennial Census Programs
3:30 PM - Discussion, Joseph Salvo, New York City Department of City Planning, and Donald Dillman, CNSTAT Member and the University of Washington
4:00 PM - Floor Discussion
4:30 PM - Reception (East Court)
5:30 PM - Adjourn

Register now!
Presentations will be available via WebEx—
login information will be sent once available
For any questions, contact Eileen LeFurgy,
CNSTAT Program Coordinator,
at elefurgy@nas.edu or (202) 334-1616.

600_439782170.jpeg

Possible Team Presentations

Subcommittee on Networking and Information Technology Research and Development

(NITRD Subcommittee)

Source: Web Address

Group Sub-groups Data Science Team (1) Comments
Interagency Working Groups (IWG)

CSIA - Cyber Security and Information Assurance

  Big Data Analytics Approach to Cyber Security
 

HEC - High End Computing

Semantic Medline  
Coordinating Groups (CGs)

HCI&IM - Human Computer Interaction and Information Management

   
 

HCSS - High Confidence Software and Systems

   
 

LSN - Large Scale Networking

   
 

JET - Joint Engineering Team

   
 

MAGIC - Middleware And Grid Interagency Coordination Team

   
 

SDP - Software Design and Productivity

   
 

SEW - Social, Economic, and Workforce Implications of IT

   
Senior Steering Groups (SSGs)

BD - Big Data

Semantic Medline Big Data Analytics Approach to Cyber Security
 

CPS - Cyber Physical Systems

   
 

CSIA R&D - Cyber Security and Information Assurance Research and Development

  Big Data Analytics Approach to Cyber Security
  Health IT R&D - Health Information Technology Research and Development Semantic Medline  
 

HITIDE - Health Information Technology Innovation and Development Environments Subgroup

Semantic Medline  
 

WSRD - Wireless Spectrum Research and Development

   
Community of Practice (CoP)

FASTER - Faster Administration of Science and Technology Education and Research

   

(1) Where did you get the data, where did you store the data, and what were your results.

Co-organizers

Short Bio: Brand Niemann, former Senior Enterprise Architect & Data Scientist with the US EPA, works as a data scientist, produces data science products, and publishes data stories for Semantic Community, AOL Government, & Data Science & Data Visualization DC

Participation in other Meetups: Data Science DC, Data Visualization DC, Graph Database, Semantic Web, Etc.

Do you live near a DC Metro Station and use Skype?: I live near the Vienna Station and I use Skype..

Short Bio: Katherine Goodier is a Technical architect for Xcelerate Solutions.  She has worked with big data as a defense enterprise architect, system engineer, and intelligence analyst and ontologist for the last 20 years.    Prior to this she was a network provisioning engineer and quality director for AT&T, Nortel, and DSC.

Participation in other Meetups:Yes

Do you live near a DC Metro Station and use Skype? Yes

Xcelerate Solutions Office Looking South to the Spring Hill Road Silver Line Metro Station

ViewofMetrofromXcelerate.JPG

Xcelerate Solutions Conference Room

XcelerateConferenceRoom.jpeg

Join.me Help

https://join.me/

You can create a recording of your join.me pro or enterprise meeting in a few easy steps.

Important: join.me pro or enterprise is required – paid or trial.

Start Join.me on Main Computer with Login

Set Microphone Volume

Share Screen

Start Join.me on another computer as participant (must have at least one)

Dial-in

Start Recording

Stop Recording

 

Watch this video to get started, or follow these steps:

  1. While logged in to your join.me pro or enterprise account, start a meeting.
  2. Have at least one participant join the meeting.
  3. Start recording: Meeting tools > Record.

  4. Stop recording: Meeting tools > Stop Recording. You'll see a notification: We are processing your recording.
  5. To view your recording:
    • Option 1: Click the my meetings link in the notification
    • Option 2: Open join.me in your browser, login, and go to my meetings > recordings.
What gets recorded?
Your recording includes the shared part of your screen and any conference audio. Sounds played on your PC/Mac and join.me video are not recorded.
How long does it take for recordings to process?
Most join.me recordings will arrive within minutes. However, there are a few reasons why your recording may take longer to appear.
  • When your meeting has a bad or choppy internet connection, it may take up to 20 minutes to process the recording
  • When audio participants fail to hang up, the recording starts to process after a 24 hour delay
Security Notes
  • Recording can only be turned on by the meeting organizer
  • Meeting participants are always notified when a meeting is being recording, including audio-only participants
  • If recording is already in progress when a participant joins, the participant is notified that the meeting is being recording
In pictures


Start recording:

 

 

Stop recording:

 

 

 

This is where you can find your recording once we've processed it (join.me > my meetings > recordings):

 

Page statistics
17896 view(s) and 759 edit(s)
Social share
Share this page?

Tags

This page has no custom tags.
This page has no classifications.

Comments

You must to post a comment.

Attachments