Minutes of NBD-PWG/Subgroups Joint Meeting on August 21, 2013

Wo Chang opened the meeting and presented an overview presentation on subgroup charters, schedules, processes, and preliminary deliverables. Please see http://bigdatawg.nist.gov/_uploadfiles/M0150_v1_7800818675.pdf for presentation material.

Subgroup Reports:

Nancy Grady represented the Definition and Taxonomy subgroup. She emphasized their focus was on new “Big Data” definitions rather trying to define all data related terms. She mentioned that horizontal scaling needed new engineering and process ordering changes. The subgroup has defined “data science” and “data scientist”. There is still work to do on metrics. Please refer document under M0142.

Geoffrey Fox represented the Requirements Subgroup. He showed Use Case Template and Use Cases collected (18 of them) by the subgroup (see M0105). Use cases submitted before Aug. 25 will be examined for requirements. He suggested that posting the Use Cases on a Web site would be a valuable information resource. Geoffrey showed the requirements extracted by Wo (M0125) from the Use Cases and mapped to 7 Categories in the Reference Architecture. The Requirements on the 3 Abstractions in the Reference Architecture are still to be determined. The Requirements deliverable will include a summary for each Use Case with the Application Description, Current Approach, and Future Approach being explored. The summary will be available by the end of the week.

Wo presented the Security and Privacy subgroup summary because none of the Co-chairs was able to attend. He showed their working requirements document that is available on Google Docs at https://docs.google.com/document/d/1oahT1sTwb7DoCeY0BGwMQy7aUF0iFAR6_rIqMif9m9A/edit?pli=1 There are 4 key areas; Infrastructure, Data Privacy, Data Management, and Integrity and Reactive Security. There are many examples of Use Cases that will be mapped to the Reference Architecture.

Orit Levin represented the Reference Architecture subgroup. She showed the Reference Architecture starting point and current diagram (see M0100). She explained some of the components including the new abstraction interfaces. The exact division between scalable and legacy components is still under discussion. Wo noted that the diagram is still evolving. Orit mentioned that new viewpoints were being considered (e.g. business perspective). The two deliverables will be a White Paper describing different Reference Architecture and Common Reference Architecture description. She described dependencies from other subgroups (e.g. the Requirements subgroup for requirements, the Def. & Tax. subgroup for definitions and taxonomies, the Roadmap subgroup for business perspective).

Carl Buffington represented the Technology Roadmap subgroup. He showed outline for deliverables (see M0087). The Roadmap subgroup needs input from other subgroups. Big Data standards are being defined by Keith Hare. Capabilities, technology readiness, maturity models, and gap analysis are being created. Big data strategies will be part of the deliverable including adoption, implementation and resourcing strategies. There will also be a concerns and assumption section at the end of the deliverable.

Final Discussions

Bob Marcus suggested that a User Guidance and User FAQ be added to the subgroup deliverables to enable users to better understand the contents. Examples are Guidance (M0149) and FAQ (M0079). This suggestion was positively received with no objections.

Outlines

The draft outlines for all the subgroups were presented and described in detail. There were no questions or objections raised about the structure.

September Meeting

Wo invited all Big Data Working Group members to attend the face-to-face meeting on September 30 at NIST headquarters in Gaithersburg, Maryland. Registration is at https://www-s.nist.gov/CRS/conf_disclosure.cfm?&conf_id=6552.

Chat Log

(12:42 PM) Brenda Kirkpatrick joined.

(12:57 PM) William Vorhies (Predictive Modeling, LLC.) joined.

(12:57 PM) Brand Niemann joined.

(12:58 PM) Keith Hare (JCC Consulting, Inc.) joined.

(12:58 PM) Steve Cotton joined.

(12:58 PM) Geoffrey Fox joined.

(12:59 PM) Bob Marcus (ET-Strategies) joined.

(12:59 PM) David Chesnut joined.

(1:00 PM) David Bruggeman joined.

(1:00 PM) paul savitz joined.

(1:00 PM) Sanjay Mishra_(Verizon) joined.

(1:01 PM) Luca Lepori joined.

(1:01 PM) Dr. Hatem Sleem joined.

(1:01 PM) Charlie joined.

(1:01 PM) Sanjay Mishra_(Verizon): Wo, is the Voice bridge open?

(1:01 PM) Stephanie Lokmer joined.

(1:02 PM) Dr. Hatem Sleem disconnected.

(1:02 PM) Ben Kobler joined.

(1:02 PM) Hatem Sleem joined.

(1:02 PM) Sanjay Mishra_(Verizon): nm, it is open now

(1:02 PM) Wo Chang (NIST): yes, the voice bridge will open later

(1:02 PM) Dan McClary (Oracle) joined.

(1:02 PM) Chaitan Baru (SDSC) joined.

(1:03 PM) Wo Chang (NIST): I meant the VOIP will open after I talked; the audio bridge should be open with *6

(1:03 PM) Nancy Grady (SAIC) joined.

(1:04 PM) Vivek Navale joined.

(1:04 PM) Dan McClary (Oracle): Wo, I'm not hearing anything on the audio bridge

(1:04 PM) Dan McClary (Oracle): it's just static

(1:04 PM) Vivek Navale: yes

(1:04 PM) Luca Lepori: yeas

(1:04 PM) Steve Cotton: yes

(1:04 PM) Carl Buffington (Vistronix) joined.

(1:05 PM) Steve Cotton: need slide show mode to see full slide

(1:06 PM) Pw Carey, (Compliance Partners, LLC) joined.

(1:06 PM) Orit Levin (Microsoft) joined.

(1:06 PM) Dave Raddatz (SGI) joined.

(1:06 PM) Steve Cotton: and maximize window, please

(1:07 PM) Nancy Grady (SAIC) disconnected.

(1:07 PM) Nancy Grady (SAIC) joined.

(1:07 PM) MITRE joined.

(1:09 PM) Charlie disconnected.

(1:09 PM) Carl Buffington (Vistronix) disconnected.

(1:10 PM) Yuri Demchenko (UvA) joined.

(1:10 PM) Brand Niemann: Brand NIemann joined

(1:11 PM) Charlie joined.

(1:11 PM) William Miller (MaCT USA) joined.

(1:12 PM) Karen G joined.

(1:14 PM) Marcia Mangold joined.

(1:15 PM) PavithraKenjige (PK Technologies) joined.

(1:16 PM) Joules Technology Inc. joined.

(1:17 PM) Tim Zimmerlin (Automation Technologies) joined.

(1:17 PM) Steve Cotton: Please maximize and use slide show--having trouble reading details.

(1:18 PM) PavithraKenjige (PK Technologies) disconnected.

(1:19 PM) Guest joined.

(1:19 PM) PavithraKenjige(PK Technologies) joined.

(1:19 PM) Alexandra Wood joined.

(1:20 PM) William Vorhies (Predictive Modeling, LLC.): Please increase the size of the PPT image on the shared desktop.

(1:20 PM) Guest disconnected.

(1:21 PM) PavithraKenjige(PK Technologies) disconnected.

(1:22 PM) William Miller (MaCT USA) disconnected.

(1:22 PM) Pw Carey, (Compliance Partners, LLC): You should be able to adjust the size via the 25% to 200% sliding scale at the bottom of the screen....

(1:23 PM) Carl Buffington (Vistronix) joined.

(1:23 PM) PavithraKenjige (PK Technologies) joined.

(1:23 PM) William Miller (MaCT USA) joined.

(1:24 PM) Alexandra Wood disconnected.

(1:24 PM) Alexandra Wood joined.

(1:29 PM) Chaitan Baru (SDSC) disconnected.

(1:29 PM) Chaitan Baru (SDSC) joined.

(1:33 PM) Angelo Calvache GE joined.

(1:34 PM) PavithraKenjige (PK Technologies) 71 joined.

(1:34 PM) PavithraKenjige (PK Technologies) disconnected.

(1:37 PM) Felix/COMINT joined.

(1:39 PM) Charlie disconnected.

(1:39 PM) Charlie joined.

(1:39 PM) Karen G: Excellent that Census data is included.

(1:43 PM) Pw Carey, (Compliance Partners, LLC): Are the 'current' list of Use Cases in DRAFT MODE....(aka: open to review & improvement)....?

(1:43 PM) nancy landreville joined.

(1:43 PM) Joules Technology Inc.: Making progress. Excellent!

(1:48 PM) Pw Carey, (Compliance Partners, LLC): Wow.....this is impressive...

(1:48 PM) Pw Carey, (Compliance Partners, LLC): No just a general query....

(1:48 PM) Wo Chang (NIST): I see.

(1:48 PM) Pw Carey, (Compliance Partners, LLC): Now back to Wow!.....

(1:48 PM) Karen G: I'd just think, good to have other group members review and comment.... that goes for all of the deliverables. As time permits. :)

(1:50 PM) Joules Technology Inc.: Agreed.

(1:52 PM) Geoffrey Fox: To Karen: Yes NARA produced 2 nice government use cases

(1:53 PM) Karen G: Will there be a 1:1 mapping from the use cases to the security / privacy reqts? That would be ideal; perhaps that isn't feasible in the timeframe?

(1:54 PM) Joules Technology Inc.: Has there been a consideration to include a continuous improvment element in the Security Privacy Framework

(1:54 PM) Joules Technology Inc.: ?

(1:55 PM) Karen G: Good to see HIPAA cited - that's mandatory in the health IT space, of course.

(1:56 PM) Alexandra Wood disconnected.

(1:56 PM) Chaitan Baru (SDSC): Geoffrey, we have a diabetes use case from UCSD. Am asking those folks to add to your use case, if they can.

(1:57 PM) Chaitan Baru (SDSC): i might be muted...

(1:58 PM) Geoffrey Fox: I think we should be able to relate SEc & Privacy to Req. Use Cases. Some verticals like e-commerce and Healthcare have major SecPrivacy issues

(1:59 PM) Charlie disconnected.

(1:59 PM) Geoffrey Fox: Chaitan. That would be great -- please either augment current use case or submit separately

(2:00 PM) Ben Kobler disconnected.

(2:04 PM) Joules Technology Inc.: Excellent!

(2:04 PM) Pw Carey, (Compliance Partners, LLC): To Joules question: Taxonomy development typically goes through different stages of a development lifecycle. 1. Gather Requirements 1.2. Analyze 2.3. Develop 3.4. Pilot 4.5. Refine 5.6. Roll Out 6....taxonomy development is usually an iterative process....so it is good to confirm our approach.....

(2:05 PM) Pw Carey, (Compliance Partners, LLC): With the end product being a living document.....

(2:06 PM) lisa martinez joined.

(2:08 PM) Angelo Calvache GE: To Joules question: continous improvement...does this relate to continuity/sustainability of the final Big Data model, document (system quality)? Or are you refering to a continual improvement such as future enhancements improvements etc.?

(2:11 PM) Pw Carey, (Compliance Partners, LLC): From my view it's future enhancements and improvements.....

(2:12 PM) Pw Carey, (Compliance Partners, LLC): Unfortunately we must break away....but please send an email for follow up ....thanks much and good meeting, Respectfully yours, Pw

(2:13 PM) lisa martinez53 joined.

(2:13 PM) Joules Technology Inc.: Will do. Thanks for your response Pw Carey

(2:13 PM) Pw Carey, (Compliance Partners, LLC) disconnected.

(2:15 PM) Chaitan Baru (SDSC): I am posting to public chat what i had sent in private chat to Nancy Grady. Since one aspect of big data might be going from "data to decisions", traceabiilty and provenance will be an important requirement all around.

(2:15 PM) lisa martinez disconnected.

(2:16 PM) William Miller (MaCT USA) disconnected.

(2:17 PM) Joules Technology Inc. disconnected.

(2:20 PM) PavithraKenjige (PK Technologies) 71 disconnected.

(2:24 PM) PavithraKenjige (PK Technologies) joined.

(2:26 PM) Bob Marcus (ET-Strategies): Suggested additions to Deliverables are Guidance for users (Example M0149) and User Q&A (Example M0079)

(2:27 PM) Vivek Navale disconnected.

(2:27 PM) Angelo Calvache GE disconnected.

(2:28 PM) Dan McClary (Oracle) disconnected.

(2:28 PM) Luca Lepori: Agreed, Bob!

(2:28 PM) Karen G: Bob, I completely agree wrt having an executive overview. I'd combine that with the

(2:29 PM) Karen G: notion of a q&a - that is, here is what each section addresses.

(2:29 PM) Vivek navale joined.

(2:30 PM) Karen G: Yes - we talked about TechAmerica at the in-person mtg in January. It was well received as I recall.

(2:32 PM) William Vorhies (Predictive Modeling, LLC.) disconnected.

(2:33 PM) Pw Carey, (Compliance Partners, LLC) joined.

(2:34 PM) Pw Carey, (Compliance Partners, LLC): FAQ....yep....sounds good....

(2:35 PM) Pw Carey, (Compliance Partners, LLC): & Guidance to Users/Readers....could we charge them for this SERVICE....?

(2:35 PM) Vivek navale disconnected.

(2:35 PM) Karen G: I'd suggest - rather than an FAQ - introduce each topic with a paragraph or two.

(2:36 PM) Dave Raddatz (SGI) disconnected.

(2:36 PM) Pw Carey, (Compliance Partners, LLC): Nice idea....Karen...but FAQ is a pretty common way to highlight what we think the audience would like to know right away....

(2:36 PM) Karen G: And have an overall exec summary / intro at start of the document / white paper. [If we neeed a q&a, perhaps fram it as for more info.

(2:37 PM) Karen G: OK, fine.

(2:37 PM) Pw Carey, (Compliance Partners, LLC): But your idea is still good.....Pw

(2:37 PM) William Miller (MaCT USA) joined.

(2:37 PM) Karen G: Sure. I'm happy with the group's decision. No worries.

(2:38 PM) Pw Carey, (Compliance Partners, LLC): We like an 'Executive Summary'....

(2:38 PM) Pw Carey, (Compliance Partners, LLC): Wait....we have an 'Executive Summary'....

(2:38 PM) Pw Carey, (Compliance Partners, LLC): Go figure.....

(2:40 PM) Steve Cotton disconnected.

(2:40 PM) Luca Lepori: So how about an infographic that summarizes the executive summary and visually conveys the FAQ? Kidding...

(2:41 PM) Steve Cotton joined.

(2:42 PM) Pw Carey, (Compliance Partners, LLC): We'll get on it right away...right after our trip to some place exotic....

(2:42 PM) Karen G: Luca, I could create a tag cloud for that - though, not an infographic. :)

(2:42 PM) Pw Carey, (Compliance Partners, LLC): Uh oh.....starting to sound like 'feature creep'....although there's nothing wrong with that.....

(2:43 PM) Luca Lepori: Haha - like that idea, PW. Tag cloud could be useful. If this was a startup, what would we do to make sure people would "get" the message and the info?

(2:43 PM) Pw Carey, (Compliance Partners, LLC): Thank gaud, my words are invisible...sorta....

(2:44 PM) David Chesnut disconnected.

(2:45 PM) Pw Carey, (Compliance Partners, LLC): We'll sit down and be quite...now....

(2:56 PM) Karen G: As the subteams' deliverables are close to final, it would be great to notify other subteams that the docs are ready for review & comment. Also as Wo mentioned, will need to harmonize the deliverables - that will be a process.

(2:59 PM) William Miller (MaCT USA) disconnected.

(2:59 PM) Chaitan Baru (SDSC): i am online, but on the phone

(2:59 PM) Steve Cotton disconnected.

(2:59 PM) Luca Lepori: I have to go now - thank you all

(2:59 PM) Chaitan Baru (SDSC): not sure how to unmute the phone

(2:59 PM) PavithraKenjige (PK Technologies) disconnected.

(2:59 PM) Pw Carey, (Compliance Partners, LLC): We think you covered it for now....(or, rather, at this point in time....) Respectfully yours, Pw

(2:59 PM) Bob Marcus (ET-Strategies): *6 unmutes phone

(3:00 PM) Nancy Grady (SAIC): is registration up yet for the Sep 30? It's not on the allevents list on the main site

(3:00 PM) Luca Lepori disconnected.

(3:00 PM) Geoffrey Fox: yes its ready

(3:00 PM) Karen G: To Nancy's question, yes, I was able to register for the Sept 30 mtg

(3:02 PM) Nancy Grady (SAIC): anyone have the pointer?

(3:02 PM) Yuri Demchenko (UvA) disconnected.

(3:03 PM) Pw Carey, (Compliance Partners, LLC): Thank you...good meeting....Pw

(3:03 PM) lisa martinez53: thank you all

(3:03 PM) Tim Zimmerlin (Automation Technologies): Great progress all!

(3:03 PM) Tim Zimmerlin (Automation Technologies) disconnected.

(3:03 PM) Carl Buffington (Vistronix) disconnected.

(3:04 PM) Orit Levin (Microsoft) disconnected.

(3:04 PM) David Bruggeman disconnected.

(3:04 PM) Karen G disconnected.

(3:04 PM) Brand Niemann disconnected.

(3:04 PM) Pw Carey, (Compliance Partners, LLC) disconnected.

(3:04 PM) Wo Chang (NIST): Registration: https://www-s.nist.gov/CRS/conf_disclosure.cfm?&conf_id=6552

(3:04 PM) paul savitz disconnected.