Sold Out

Statistical Methods for the Analysis of High-Dimensional and Massive Data S...

Event Information

Share this event

Date and Time

Location

Location

Day 1 Bldg P, Room P419; Day 2 Bldg P, Room P419

QUT Gardens Point Campus, (Day 1 GP-P419) (Day 2 GP-P419

2 George Street

Brisbane, QLD 4000

Australia

View Map

Event description

Description

Organised by Benoit Liquet, Kerrie Mengersen and Tony Pettitt

“2-day Statistical Methods for the Analysis of High-Dimensional and Massive Data set”

Big data is a fast-growing field and skills in the area are some of the most in demand today. During these two days, we will introduce big data and some of the statistical and mathematical approaches for analysing it. The big problem is that the data is big - the size, complexity and diversity of datasets increases every day. This means we need new solutions for analysing data. The first day tutorial will equip you for working with these solutions by introducing you to selected statistical and machine learning techniques used for analysing large datasets and extracting information. The second day will present talks covering recent novel Statistical and Mathematical approaches to analyse complex large datasets.

The first day, Thursday, is a tutorial style day using R software suitable to analyse High-Dimensional and Massive Data set.

Presenters and tutors include:

  • Professor Benoit Liquet, University of Pau and Pays de L’Adour (E2S), ACEMS (QUT)
  • Dr Pierre Lafaye de Micheaux (UNSW)

Outline program for workshop

Opening Remarks: Kerrie Mengersen, Chief Investigator at ACEMS

Day 1 - Thursday, 24 January

A gentle tutorial style day with R software and libraries.

You will be introduced to:

- Manage massive matrices with shared memory and memory-mapped files with the bigmemory R package

- Divide and Recombine Strategy

- Parallel computing on a virtual machine

- Sparse model for high dimensional data with the glmnet R package

- Lasso and elastic-net linear and logistic models for ultrahigh-dimensional data with the biglasso R package

- Sparse Partial Least squares approaches for big-data with the bigsgPLS R package

This is a hands on workshop. Attendees should have had some experience working with the R language before and should bring along a machine (Windows, Mac, Linux all great) running a recent build of R.

Day 2 - Friday, 25 January

A series of talks linking to Day 1 activities. The participants include:

-Professor Rob Hyndman (Monash)

-Professor Robert Kohn (ANU)

- Professor Matt Wand (UTS)

- Professor Matt Roughan (The University of Adelaide)

- Dr Tamara Broderick (MIT)

- Dr Antonietta Mira (USI)

- Dr Francesco Bartolucci (Perugia University)

- Leah South (QUT)

- Dr Pierre Lafaye de Micheaux (UNSW Sydney)

- Dr Alan Huang (UQ)


PLEASE NOTE: Catering of morning tea, lunch and afternoon tea will be provided on both days. Numbers are limited so please ensure you register your attendance for catering purposes. If for some reason you need to cancel your registration, please ensure you contact the organisers so your place can be allocated to someone else.

Date and Time

Location

Day 1 Bldg P, Room P419; Day 2 Bldg P, Room P419

QUT Gardens Point Campus, (Day 1 GP-P419) (Day 2 GP-P419

2 George Street

Brisbane, QLD 4000

Australia

View Map

Save This Event

Event Saved