in san francisco, CA
100 attending · 16 waiting
in Mountain View, CA
18 attending
in Santa Clara, CA
156 attending
in San Francisco, CA
102 attending
in Foster City, CA
91 attending
in San Francisco, CA
33 attending
in San Francisco, CA
118 attending
in Fremont 94538, CA
63 attending
71 attending
in Sunnyvale, CA
350 attending · 11 waiting
27 Meetup Groups match “MapReduce” near San Mateo, CA
In this class, I will go over the basic tools for big data analysis, namely Hadoop, MapReduce, Hive and Apache Pig. We will go through several examples in order to build an indepth understanding of these different languages and tools. I will provide some simple examples as a starting point and we will work through them using Hadoop, MapReduce, Hive and Pig.
Hive is a scalable data warehouse infrastructure built on top of Hadoop. It provides tools to enable easy data ETL, a mechanism to put structures on the data, and the capability to querying and analysis of large data sets stored in Hadoop files. Hive defines a simple SQL-like query language, called HiveQL, that enables users familiar with SQL to query the data. At the same time, this language also allows programmers who are familiar with the MapReduce fromwork to be able to plug in their custom …
What: Monthly meetings with hadoop contributors on relevant issues. Target Audience: Active contributors to Hadoop HDFS and MapReduce projects. Where: Rotating venues. When: Monthly, normal business hours.
Group for anyone using Apache Hadoop and wanting to be productive with data analysis. In particular with Hue, a Web application providing access to MapReduce, Hive, HDFS, Oozie, Impala, Pig, Sqoop2. Also interested in hearing about use cases, pain-points... about using Hadoop and how the end user experience could be improved. Hue open source project: http://gethue.com
There are no upcoming Meetups.
A Bay Area based Hadoop user group focused on helping Hadoop users share experiences, problems and solutions, as well as learn new skills for building large-scale Hadoop-based systems.
Join us to learn about Hadoop and related technologies, including use cases, new features and exciting networking with other hadopp professionals. See past meetups on YDNTheater Channel on YouTube, presentation slides available on Slideshare.
These are not your typical software engineering events. BASE is focused on emerging technologies and creating events around topics that have never been attempted. Events may be around such topics as Big Data, Machine Learning, Robotics, Nanotechnology, 3D Printing, Synthetic Biology, Artificial Intelligence, Computer History, Computer Vision, Augmented Reality (AR), and using software to create art. BASE creates events only possible in San Francisco and the Bay area with experts from all walks …
There are no upcoming Meetups.
The Apache Cassandra Project develops a highly scalable second-generation distributed database, bringing together Dynamo's fully distributed design and Bigtable's ColumnFamily-based data model. The DataStax Cassandra SF User Group is a highly awesome meetup group, bringing together cool people distributed across the bay area to talk about our favorite NoSQL implementation, Apache Cassandra.
Today is the age of Scalable Systems and Big Data. Providing a meetup where we can discuss innovation, solutions and have pub nights for drinking and socializing.
Quarterly meetings of Oozie contributors. Users are very welcome to attend, however if space ever becomes an issue we reserve the right to prioritize accordingly.
For Advanced AWS topics that assume deep knowledge of AWS services.
There are no upcoming Meetups.
A meetup in the SF Bay Area about Cascading, Scalding (Scala), Cascalog (Clojure), PyCascading (Python), and other DSLs atop http://cascading.org/
There are no upcoming Meetups.
This is a meetup group for users and contributors to Apache Sqoop (incubating). Sqoop is a system designed for efficiently transferring bulk data between Apache Hadoop and structured datastores such as relational databases. You can use Sqoop to import data from external structured datastores into Hadoop Distributed File System or related systems like Hive and HBase. Conversely, Sqoop can be used to extract data from Hadoop and export it to external structured datastores such as relational datab …
There are no upcoming Meetups.
Calling all newcomers transitioning to the field of Data Mining and Visualization in South Bay. Lets get-together to code, discuss projects, and materials that can help get jump started in this new and very exciting area. Looking forward to exploring and learning more about this field with you.
There are no upcoming Meetups.
This group is open to all those interested in Apache Cassandra, Big Data, Hadoop, Hive, Solr, Hector, Open Source, NoSQL, DataStax, Apache Pig, and high scalability computing. Let's get together and share what we know.
This is a meetup for Bay Area users of Spark (www.spark-project.org), the high-speed Scala-based cluster programming framework. We'll be rotating among locations in San Francisco, Silicon Valley, and Berkeley. We'll also discuss other Spark-related projects, including the Hive-on-Spark port (Shark), and other new programming tools for big data. The meetup will include introductions to the various Spark features, case studies from current users, best practices for deployment and tuning, and futur …
This meetup is about sharing knowledge & learning the latest developments about statistics, machine learning, math, analytics, parallel algorithms, distributed systems.
There are no upcoming Meetups.
Modern machine learning algorithms like Support Vector Machine, Random Forest and Neural Nets are producing very promising results on large scale datasets. However understanding what is happening inside the black box and the nuanced differences of various implementations can be challenging. Topics and presentations will include open source tools, real-world case studies and an occasional talk by a theorist.
There are no upcoming Meetups.
Big Brains is a speakers series with some of the top minds in Silicon Valley. All are welcome. Just bring your Big Brain along! We'll be hosting at least every other month, if not monthly. Look forward to seeing you at our office for one of our upcoming events in Mountain View.
There are no upcoming Meetups.
We are a Bay Area user group focused on helping Datameer users share experiences, tech tips, solve big data analytic problems and solutions. This group is for those who want to learn and advance their knowledge and use of Datameer. Intended to be casual, this Meetup will encourage knowledge sharing and discussions set 'unconference' style. At the beginning of the meetup, we will showcase a best practice then break out into discussion groups. All Datameer related topics will be welcome.
There are no upcoming Meetups.
There are no upcoming Meetups.
A San Francisco-based Hadoop user group focused on helping Hadoop users share experiences, problems and solutions, as well as learn new skills for building large-scale Hadoop-based systems. Please fill out this short survey to help determine the best date/time for most people to meet - http://bit.ly/ajK26U.
There are demands for good mathematicians to write algorithms that can churn through billions or trillions of data points and show where patterns emerge. The Economist data issue raised this issue as follows: "During the recent financial crisis it became clear that banks and rating agencies had been relying on models which, although they required a vast amount of information to be fed in, failed to reflect financial risk in the real world. This was the first crisis to be sparked by big data—and …
There are meet-ups and there are meet-ups! So, why add another meet-up? Well, we felt that it’s time that there should be a meet-up wholly dedicated to all things around “ Big Data” & “ Cloud Computing”. Mankind has never seen so much data as it is seeing today; Big Data is a serious challenge for all businesses, small and big. Cloud Computing technologies are changing the way we conceptualize, design & develop business solutions. These two buzzwords need to lose their “buzz” status and become …
This group is for everyone interested in real-time access to big data. We welcome developers, data scientists, analysts - everyone trying to figure out how to get access - and analysis - quickly; data in HDFS, S3, NoSQL databases, etc. Why did I form this group? There are a number of interesting Open Source and proprietary solutions trying to provide real-time access to Big Data and a high density of folks here in the Bay Area working on solutions. Industry analysts are saying "this is the year …
There are no upcoming Meetups.
This group covers everything related to distributed data processing and one key focus is Mesos as a platform for creating distributed systems. Topics will include ops and tools for running mesos on cloud infrastructure or in a custom data center, data processing with spark, realtime data processing with storm and how to build custom schedulers.
There are no upcoming Meetups.
For people interested in Apache Mahout machine learning libraries, or interested in learning about machine learning in general.
Get an alert email when new Meetup Groups like this start near you.
You'll get advice, help finding members, and tools to make running a Meetup Group easier.