Programme And Abstracts For Monday 27th Of November
Keynote: Monday 27th 9:40 Mantra
Agricultural And Agri-Environmental Statistics With Support Of Geospatial Information, Methodological Issues
Elisabetta Carfagna
University of Bologna
Agri-environmental trade-offs are issues critical for policy makers charged with managing both food supply and the sustainable use of the land. Reliable data are crucial for developing effective policies and for evaluating their impact. However, often the reliability of agricultural and agro-environmental statistics is low.
Due to the technological development, in the last decades, different kinds of geospatial data have become easily accessible at decreasing prices and have started to be an important support to statistics production process.
In this paper, we focus on methodological issues related to the use of geospatial information for sampling frame construction, sample design, stratification, ground data collection and estimation of agricultural and agri-environmental parameters. Particular attention is devoted to the impact of spatial resolution of data, change of support aggregation and disaggregation of spatial data, when remote sensing data, Global Positioning Systems and Geographic Information Systems (GIS) are used for producing agricultural and agro-environmental statistics.
Monday 27th 11:00 Narrabeen
Developing A Regulatory Definition For The Authentication Of Manuka Honey
Claire McDonald1, Suzanne Keeling1, Mark Brewer2, and Steve Hathaway1
1Ministry for Primary Industries
2BioSS
Manuka honey is a premium export product from New Zealand that has been under scrutiny due to claims of fraud, adulteration and mislabelling. Although there are several industry approaches for defining manuka honey, there is currently no scientifically robust definition suitable for use in a regulatory setting. As such, ensuring the authenticity of manuka honey is challenging.
Here we present the results of a three year science programme which developed scientifically robust definitions for monofloral and multifloral manuka honey produced in New Zealand. The programme involved: selecting appropriate markers to identify honey sourced from Leptospermum scoparium (manuka), establishing plant and honey reference collections, developing test methods to determine the levels of the markers and analysing the data generated to develop the definitions.
The suitability of 16 markers (chemical and DNA-based) were evaluated for use in a regulatory definition for manuka honey. Plant samples were collected from two flowering seasons representing both manuka and non-manuka species from both New Zealand and Australia. Honey samples, also representing manuka and non-manuka floral types, were sourced from seven New Zealand production seasons. Additionally, honey samples were sourced from another 12 countries to enable comparison. All samples were tested for the markers being evaluated using the developed test methods.
The method of CART (Classification and Regression Trees) was used to develop the monofloral and multifloral manuka honey definitions. The CART outputs were further processed using a simulation approach to determine the sensitivity and the robustness of the definitions. The definitions use a combination of 5 markers (4 chemical and 1 DNA) at set thresholds to classify a sample as manuka honey or otherwise. We discuss the practicalities of using the science-based definitions within a regulatory context.
Monday 27th 11:00 Gunnamatta
On Testing Random Effects In Linear Mixed Models
Alan Welsh1, Francis Hui1, and Samuel Mueller2
1ANU
2University of Sydney
Monday 27th 11:20 Narrabeen
Analysing Digestion Data
Maryann Staincliffe, Debbie Frost, Mustafa Farouk, and Guojie Wu
AgResearch
Monday 27th 11:20 Gunnamatta
A Permutation Test For Comparing Predictive Values In Clinical Trials
Kouji Yamamoto and Kanae Takahashi
Osaka City University
Screening tests or diagnostic tests are important for early detection and treatment of disease. There are four well-known measurements, sensitivity (SE), specificity (SP), positive predictive value (PPV) and negative predictive value (NPV) in diagnostic studies. For comparing SEs/SPs, McNemar test is widely used, but there are only few methods for the comparison of PPVs/NPVs. Moreover, all of these methods are based on large-sample theory.
So, in this talk, firstly, we investigate the performance of those methods when the sample size is small. In addition, we propose a permutation test for comparing two PPVs/NPVs we can apply even if the sample size is small. Finally, we show the performance of the proposed method with some existing methods via simulation studies.
Monday 27th 11:40 Narrabeen
Challenges And Opportunities Working As A Consulting Statistician With A Food Science Research Group
M. Gabriela Borgognone
Queensland Department of Agriculture and Fisheries
When an established research group has been functioning for many years without a statistician as an integral part of the team, welcoming one into the group can present challenges as well as opportunities for all involved.
Challenges for the research group include, for example, involving the statistician at the beginning of the study instead of once the experiments have been completed and the data collected; acquiring or increasing knowledge of experimental design principles; understanding the limitations of some statistical analyses, expanding the range of methods they feel familiar with, and learning when/how to apply each one; and improving the presentation of results in this era where poor presentation is perpetuated by the general lack of sound statistical methods in the literature of the research area. Challenges for the statistician include, for example, overcoming his/her lack of general knowledge of the underlying scientific area and its specific vocabulary; determining what experimental designs would work from a practical point of view; developing understanding of their scientific questions, data management practices, and types of data collected; navigating the various software they use and checking their adequacies and limitations; and, above all, communicating with patience and perseverance.
Correspondingly, all challenges present opportunities for improvement and collaboration between scientists and statisticians. Working as a team supports a decision making process that is relevant to industry and that is based on good statistical practices. Additionally, it helps scientists become more statistically aware and empowered. A bit more than a year ago I started working as a consulting statistician with a food science research group. In this presentation I will share some of the challenges and the opportunities to incorporate good statistical practice I have identified, as well as some of the improvements we have made so far working together in this partnership.
Monday 27th 11:40 Gunnamatta
Robust Semiparametric Inference In Random Effects Models
Michael Stewart1 and Alan Welsh2
1University of Sydney
2ANU
Monday 27th 12:00 Gunnamatta
Robust Penalized Logistic Regression Through Maximum Trimmed Likelihood Estimator
Hongwei Sun, Yuehua Cui, and Tong Wang
Shanxi Medical University
Keynote: Monday 27th 13:30 Mantra
A Multi-Step Classifier Addressing Cohort Heterogeneity Improves Performance Of Prognostic Biomarkers In Complex Disease
Jean Yang
University of Sydney
Recent studies in cancer and other complex diseases continue to highlight the extensive genetic diversity between and within cohorts. This intrinsic heterogeneity poses one of the central challenges to predicting patient clinical outcome and the personalization of treatments. Here, we will discuss the concept of classifiability observed in multi-omics studies where individual patients’ samples may be considered as either hard or easy to classify by different platforms, reflected in moderate error rates with large ranges. We demonstrate in a cohort of 45 AJCC stage III melanoma patients that clinico-pathologic biomarkers can identify those patients that are most likely to be misclassified by a molecular biomarker. The process of modelling the classifiability of patients was then replicated in independent data from other diseases.
A multi-step procedure incorporating this information not only improved classification accuracy overall but also indicated the specific clinical attributes that had made classification problematic in each cohort. In statistical terms, our strategy models cohort heterogeneity via the identification of interaction effects in a high dimensional setting. At the translational level, these findings show that even when cohorts are of moderate size, including features that explain the patient-specific performance of a prognostic biomarker in a classification framework can significantly improve the modelling and estimation of survival, as well as increase understanding.
Monday 27th 14:20 Narrabeen
How To Analyse Five Data Points With Fun
Pauline O’Shaughnessy1, Stephen Robson2, and Louise Rawlings2
1University of Wollongong
2ANU
Monday 27th 14:20 Gunnamatta
An Approach To Poisson Mixed Models For -Omics Expression Data
Irene Suilan Zeng and Thomas Lumley
University of Auckland
Monday 27th 14:40 Narrabeen
Bayesian Spatial Estimation When Areas Are Few
Aswi Aswi, Susanna Cramb, Earl Duncan, and Kerrie Mengersen
Queensland University of Technlogy
Monday 27th 14:40 Gunnamatta
Knowledge-Guided Generalized Biclustering Analysis For Integrative –Omics Analysis
Changgee Chang1, Yize Zhao2, Mingyao Li1, and Qi Long1
1University of Pennsylvania
2Cornell University
Monday 27th 15:00 Narrabeen
Understanding The Variation In Harvester Yield Map Data For Estimating Crop Traits
Dean Diepeveen1, Karyn Reeves1, Adrian Baddeley1, and Fiona Evans2
1Curtin University
2Murdoch University
Monday 27th 15:00 Gunnamatta
Citizen Science To Surveillance: Estimating Reporting Probabilities Of Exotic Insect Pests
Peter Caley1, Marijke Welvaert2, and Simon Barry1
1CSIRO
2University of Canberra
Up until mid-2016, citizen science uploads to the Atlas of Living Australia included c. 400 bug species, and c. 1,000 beetle species. Given the short time period (c. 3 years) over which most of these records have accumulated, this represents a considerable reporting effort. The key applied question from a biosecurity context is how this level of reporting translates to the detection and reporting of exotic insect pests in the event of an incursion.
We use a case-control design to model the probability of existing insect species being reported via citizen science channels feeding into the Atlas of Living Australia. The effect of insect features (size, colour, pattern, morphology) and geographic distribution on reporting rates are explored as explanatory variables. We then apply the model to exotic high priority pest species to predict their reporting rates in the event of their introduction.
Monday 27th 15:50 Narrabeen
Introduction To “Deltagen” - A Comprehensive Decision Support Tool For Plant Breeders Using R And Shiny
Dongwen Luo and Zulfi Jahufer
AgResearch
Monday 27th 15:50 Gunnamatta
Estimating Nitrous Oxide Emission Factors
Alasdair Noble and Tony Van Der Weerden
AgResearch