BEGIN:VCALENDAR
VERSION:2.0
PRODID:Linklings LLC
BEGIN:VTIMEZONE
TZID:America/Denver
X-LIC-LOCATION:America/Denver
BEGIN:DAYLIGHT
TZOFFSETFROM:-0700
TZOFFSETTO:-0600
TZNAME:MDT
DTSTART:19700308T020000
RRULE:FREQ=YEARLY;BYMONTH=3;BYDAY=2SU
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0600
TZOFFSETTO:-0700
TZNAME:MST
DTSTART:19701101T020000
RRULE:FREQ=YEARLY;BYMONTH=11;BYDAY=1SU
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTAMP:20240116T191657Z
LOCATION:DEF Concourse
DTSTART;TZID=America/Denver:20231114T100000
DTEND;TZID=America/Denver:20231114T170000
UID:submissions.supercomputing.org_SC23_sess289_spostg107@linklings.com
SUMMARY:Better Data Splits for Machine Learning with Astartes
DESCRIPTION:ACM Student Research Competition: Graduate Poster, ACM Student
  Research Competition: Undergraduate Poster, Posters\n\nJackson Burns (Mas
 sachusetts Institute of Technology (MIT))\n\nMachine Learning (ML) has bec
 ome an increasingly popular tool to accelerate traditional workflows. Crit
 ical to the use of ML is the process of splitting datasets into training, 
 validation, and testing subsets to develop and evaluate models. Common pra
 ctice is to assign these subsets randomly. Although this approach is fast,
  it only measures a model's capacity to interpolate. These testing errors 
 may be overly optimistic on out-of-scope data; thus, there is a growing ne
 ed to easily measure performance for extrapolation tasks. To address this 
 issue, we report astartes, an open-source Python package that implements m
 any similarity- and distance-based algorithms to partition data into more 
 challenging splits. This poster focuses on use-cases within cheminformatic
 s. However, astartes operates on arbitrary vectors, so its principals and 
 workflow are generalizable to other ML domains as well. astartes is availa
 ble via the Python package managers pip and conda and is publicly hosted o
 n GitHub (github.com/JacksonBurns/astartes).\n\nRegistration Category: Tec
 h Program Reg Pass, Exhibits Reg Pass
END:VEVENT
END:VCALENDAR
