Building a Source Code Mining Tool Using Java and Solr by Gary Sieling

Sponsored By

ABSTRACT: Building a Source Code Mining Tool Using Java and Solr

Mining source code repositories for useful insights is a practical way for developers to experiment with readily available data. One such problem is determining which developers have worked the most with particular clients or tools. In an organization that has been around for a while, information is often spread across different locations (e.g. CVS, subversion, Git, old emails, Sharepoint)- each require different query tactics. This talk demonstrates how to build a Java application which indexes Git repositories in a Solr full-text index, providing useful analytics through faceted search.

We found that this tool could identify which engineers worked on different projects fairly accurately. While this is already well-known information within the organization, it provides a useful demonstration of configuring full-text search for developers interested in the subject.

solrmtg

SPEAKER BIO: Gary Sieling

Gary Sieling is a Sr. Software Engineer at Wingspan Technology. He focuses on delivering large enterprise applications into complex client environments. In past projects, he’s worked on data-warehouse backed products, and is proficient in a wide range of tools, including Java, ExtJS, Oracle, and Postgres. Gary holds a BS in computer science degree from the Rochester Institute of Technology. He writes regularly at www.garysieling.com, and is also a regular contributor on architects.dzone.com.

MEETING SLIDES: Slides are available on Gary’s site here

Swag courtesy of Stack Overflow and Murach

Functional Wrappers for Legacy API’s by Martin Snyder

Sponsored By

ABSTRACT: Functional Wrappers for Legacy API’s

There are many challenges in introducing a new technology into an existing project or organization. This presentation details an approach to introducing Scala into JDBC applications, but the lessons and techniques can be applied in a general sense to introducing functional programming into any non-functional project. Specifically, a line-by-line construction is provided of a Scala wrapper for portions of JDBC. The chosen example serves to highlight the mechanics, benefits and underlying principles of functional programming in such a way that they can be generally applied.

SPEAKER BIO: Martin Snyder

Martin Snyder (@martinsnyder) is a Principal Technologist at Wingspan Technology, Inc. He is responsible for the software product organization and is a Senior Architect for the DocWay Suite and eTMF products. Previously, he was the founder of Ethermoon Entertainment, Inc, which published the RTS game Strifeshadow in 2001. Martin earned a BS in Computer Science from Cornell Engineering.

MEETING SLIDES: Video of the presentation can be found here. Martin’s slides on GitHub here and the code on GitHub is here

Snyderlive2 Snyderlive1

Mechanical Sympathy by Jim Carroll

Sponsored By

ABSTRACT: Mechanical Sympathy

“Mechanical Sympathy” is a phrase (as well as a blog) applied by Martin Thompson to how to write software that cooperates with the underlying CPU architecture. We will discuss how to use “mechanical sympathy” to get high-performance multi-threading behavior in Java up to 100s of times greater than using traditional locking or by using the concurrency classes by using “lock-free” techniques that take into account how “memory barriers” effect performance.

Filtered for your protection

SPEAKER BIO: Jim Carroll

Jim Carroll is a Navteq Fellow at Nokia, Location and Commerce. He is the Lead Architect for the Traffic Products and is currently the project lead for Dempsy, Nokia’s Stream-based Real-time Map-Reduce system that was just open sourced. Jim earned a Master’s Degree from Villanova University.

MEETING SLIDES:

Mechanical Sympathy (updated PDF)

CODE SAMPLE:

MechanicalSympathy.java (zip file)

Big Data Quadfecta (Kafka, Elastic Search, Cassandra, Storm)

Sponsored By

ABSTRACT: Big Data Quadfecta

A successful Big Data platform combines distributed processing and polyglot persistence into a single cohesive infrastructure. Over the past few years, Health Market Science has transitioned from traditional relational databases and enterprise systems to a massively scalable Big Data platform that combines Cassandra and Storm to ingest thousands of feeds of data from the health market industry to produce a single high-quality masterfile. Hear how we applied event processing and NoSQL to deliver real-time analytics, while accommodating structural change over time, and fuzzy/geospatial search. Oneill Quadfecta pic

SPEAKER BIO: Brian O’Neill

ONEILL Brian O’Neill (@boneill42) is Lead Architect at Health Market Science (HMS) where he heads design and development of their Master Data Management (MDM) solution and Big Data platform that targets the Healthcare space, powered by Storm and Cassandra. He leads and contributes to multiple open-source projects that extend Cassandra and integrate it with full-text indexing engines and event processing frameworks. Brian is author of the Dzone reference card on Cassandra, and was selected as a Datastax MVP for Cassandra. In the past, Brian has contributed to expert groups within the JCP and has patents in artificial intelligence and context-based discovery. He holds a B.S. in C.S. from Brown University.

MEETING SLIDES: On Slideshare or Quadfecta PDF

The audience got smarter that day.

Tim Anglade’s Build a Store Locator App in 40 Minutes with HTML5, PhoneGap, and an API-Based Back-End

Sponsored By

ABSTRACT: Build a Store Locator App in 40 Minutes with HTML5, PhoneGap, and an API-Based Back-End

Building a mobile app is hard… Right? It doesn’t have to be! This hands on demo will walk you through the build of a full, app store-ready app in 40 minutes or less. Together we’ll code a store locator app from scratch, showing you the 5 stores closest to you, in fewer than 100 lines of code!

Join Tim Anglade for this walkthrough and walk away with the know how to create a full mobile app – ready for the app store in less than an hour.
There will be a handout with links to the free utilities used in the demo and a recording sent so you can do it yourself.

Tim will cover:
·         Building mobile apps with HTML5 and Adobe PhoneGap
·         Geolocation and context-aware app features
·         Developing & deploying apps easily with Apigee
SPEAKER BIO: Tim Anglade

anglade Tim Anglade is Head of Developer Programs for Apigee, the API company. In previous lives he was the CTO of a small startup, a grad. school lecturer, and an invited expert at the W3C. He enjoys long walks on the beach and hates writing about himself in the third person.

MEETING SLIDES:

Link to Slides
Store Locator Demo on GitHub
Store Locator Code on GitHub
Getting Started on Android from Apigee’s GitHub
Webcast Video

Sacha Labourey on PaaS for Java Developers

Sponsored By

ABSTRACT: Platform as a Service: Transforming the lives of Java Developers

Java developers today face significant hurdles and delays in building and deploying new innovative applications quickly and Sacha efficiently. So how can middleware in the cloud in the form of a PaaS (Platform as a Service) make their lives simpler and easier? Sacha Labourey, CEO of CloudBees and former CTO of JBoss, will discuss how PaaS can make developers more productive, creative, empowered and responsible for their ultimate objective: creating value for the business by focusing 100% of their time on developing business-critical applications.

Discussion Topics:

Life as a developer today
The future is based on Service not Software
Continuous Integration – Continuous Deployment
Any Cloud: Public, Private or Hybrid
Software development in the cloud era
The CloudBees PaaS

SPEAKER BIO: Sacha Labourey

Sacha Labourey Sacha was born in Switzerland and graduated in 1999 from EPFL. It was during Sacha’s studies in 1996 that he started his first consulting business – Cogito Informatique. In 2001, he joined Marc Fleury’s JBoss project as a core contributor and implemented JBoss’ original clustering features. In 2003, Sacha founded the European headquarters for JBoss and, as GM for Europe, led the strategy and partnerships that helped fuel the company’s growth in that region. While in this position, he led the recruitment of some of JBoss’ key talent and acquisition of key technology. In 2005, he was appointed CTO of JBoss, Inc. and as such, oversaw all of the JBoss engineering activities. In June 2006, JBoss, Inc. was acquired by Red Hat (NYSE:RHT). After the acquisition, Sacha remained JBoss CTO and played a crucial role in integrating and productizing JBoss software with Red Hat offerings. In 2007, Sacha became co-General Manager of Red Hat’s middleware division. He ultimately left Red Hat in April 2009. Following a period of research, Sacha became convinced that public cloud infrastructure would lead a fundamental IT paradigm shift and that middleware would play a key role in that shift. As a result CloudBees, Inc. was formed in April 2010.

MEETING SLIDES: PaaS for Java Developers PDF

crowd photo

Hadoop and Mahout by Anton Slutsky

Sponsored by

Connexin Software

ABSTRACT: Hadooop and Mahout

In recent years, cheap commodity storage hardware and cloud enabled companies to collect and accumulate vast amounts of information. The need for understanding and making use of this information has given birth to the concept of Big Data and necessitated the need for distributed processing frameworks among which Hadoop is a current leader. While Hadoop provides a platform for Big Data processing, in and of itself it does not expose facilities for pattern and knowledge discovery in data. Add knowledge and pattern discovery capabilities to Hadoop is the goal of the Apache Mahout project, which exposes a set of cutting edge Machine Learning and Artificial Intelligence libraries designed for the Hadoop platform.

Learning was done, pizzas were eaten.

In this talk, I am going to present a use case of using Machine Learning in the Big Data context and walk through an example of addressing the use case using Mahout and Hadoop. The walk-through will assume basic understanding of Hadoop installation and configuration and will focus on challenges associated with using the Mahout product, which (while providing an excellent collection of machine learning algorithms) is not trivial to set up and configure.

SPEAKER BIO: Anton Slutsky

Anton Slutsky is an experienced information technology professional with over a decade of experience in the field. He has a Masters degree in Computer Science from Villanova University and is currently working on his PhD at Drexel University with published research works in the area of Artificial Intelligence, Machine Learning and Data Mining. Prior to his current position as Chief Science Officer at Zelant Software, Inc, Anton led engineering efforts at the Oracle and BEA Systems. Currently, Anton is involved in promoting the concept of embedded analytics – an approach meant to operationalize Big Data using cutting edge machine learning and data mining techniques and research findings.

MEETING SLIDES
hadoop+mahout PDF Hadoop + Mahout code zip file

PhillyJUG

talking Java + killing pizza since 2000

Building a Source Code Mining Tool Using Java and Solr by Gary Sieling

Functional Wrappers for Legacy API’s by Martin Snyder

Mechanical Sympathy by Jim Carroll

Big Data Quadfecta (Kafka, Elastic Search, Cassandra, Storm)

Tim Anglade’s Build a Store Locator App in 40 Minutes with HTML5, PhoneGap, and an API-Based Back-End

Sacha Labourey on PaaS for Java Developers

Hadoop and Mahout by Anton Slutsky

Connexin Software

Follow “PhillyJUG”