Mentors
The following developers have agreed to serve as mentors for the 2015 Google Summer of Code:
Ideas List
Design and Implement a Heap Profiler
A heap profile is a description of the space usage of a program. A heap profile is concerned with the allocation, retention, and deallocation (via garbage collection) of heap data during the execution of a program. A heap profile can be used to diagnose performance problems in a functional program that arise from space leaks. This project aims to design and implement a heap profiler for MLton compiled programs.
Background:
-
GCspy: an adaptable heap visualisation framework; Tony Printezis and Richard Jones
-
New dimensions in heap profiling; Colin Runciman and Niklas Röjemo
-
Heap profiling for space efficiency; Colin Runciman and Niklas Röjemo
-
Heap profiling of lazy functional programs; Colin Runciman and David Wakeling
Recommended Skills: C and SML programming experience; some experience with UI and visualization
Garbage Collector Improvements
The garbage collector plays a significant role in the performance of functional languages. Garbage collect too often, and program performance suffers due to the excessive time spent in the garbage collector. Garbage collect not often enough, and program performance suffers due to the excessive space used by the uncollected garbage. One particular issue is ensuring that a program utilizing a garbage collector "plays nice" with other processes on the system, by not using too much or too little physical memory. While there are some reasonable theoretical results about garbage collections with heaps of fixed size, there seems to be insufficient work that really looks carefully at the question of dynamically resizing the heap in response to the live data demands of the application and, similarly, in response to the behavior of the operating system and other processes. This project aims to investigate improvements to the memory behavior of MLton compiled programs through better tuning of the garbage collector.
Background:
-
The Garbage Collection Handbook: The Art of Automatic Memory Management; Richard Jones, Antony Hosking, Eliot Moss
-
Dual-Mode Garbage Collection; Patrick Sansom
-
Automatic Heap Sizing: Taking Real Memory into Account; Ting Yang, Matthew Hertz, Emery D. Berger, Scott F. Kaplan, and J. Eliot B. Moss
-
Controlling Garbage Collection and Heap Growth to Reduce the Execution Time of Java Applications; Tim Brecht, Eshrat Arjomandi, Chang Li, and Hang Pham
-
Isla Vista Heap Sizing: Using Feedback to Avoid Paging; Chris Grzegorczyk, Sunil Soman, Chandra Krintz, and Rich Wolski
-
The Economics of Garbage Collection; Jeremy Singer, Richard E. Jones, Gavin Brown, and Mikel Luján
-
Automated Heap Sizing in the Poly/ML Runtime (Position Paper); David White, Jeremy Singer, Jonathan Aitken, and David Matthews
-
Control Theory for Principled Heap Sizing; David R. White, Jeremy Singer, Jonathan M. Aitken, and Richard E. Jones
Recommended Skills: C programming experience; some operating systems and/or systems programming experience; some compiler and garbage collector experience
Heap-allocated Activation Records
Activation records (a.k.a., stack frames) are traditionally allocated on a stack. This naturally corresponds to the call-return pattern of function invocation. However, there are some disadvantages to stack-allocated activation records. In a functional programming language, functions may be deeply recursive, resulting in call stacks that are much larger than typically supported by the operating system; hence, a functional programming language implementation will typically store its stack in its heap. Furthermore, a functional programming language implementation must handle and recover from stack overflow, by allocating a larger stack (again, in its heap) and copying activation records from the old stack to the new stack. In the presence of threads, stacks must be allocated in a heap and, in the presence of a garbage collector, should be garbage collected when unreachable. While heap-allocated activation records avoid many of these disadvantages, they have not been widely implemented. This project aims to implement and evaluate heap-allocated activation records in the MLton compiler.
Background:
-
Empirical and Analytic Study of Stack Versus Heap Cost for Languages with Closures; Andrew W. Appel and Zhong Shao
-
Space-efficient closure representations; Zhong Shao and Andrew W. Appel
-
Representing control in the presence of first-class continuations; R. Hieb, R. Kent Dybvig, and Carl Bruggeman
Recommended Skills: SML programming experience; some middle- and back-end compiler experience
Correctly Rounded Floating-point Binary-to-Decimal and Decimal-to-Binary Conversion Routines in Standard ML
The
IEEE Standard for Floating-Point Arithmetic (IEEE 754)
is the de facto representation for floating-point computation.
However, it is a binary (base 2) representation of floating-point
values, while many applications call for input and output of
floating-point values in decimal (base 10) representation. The
decimal-to-binary conversion problem takes a decimal floating-point
representation (e.g., a string like "0.1"
) and returns the best
binary floating-point representation of that number. The
binary-to-decimal conversion problem takes a binary floating-point
representation and returns a decimal floating-point representation
using the smallest number of digits that allow the decimal
floating-point representation to be converted to the original binary
floating-point representation. For both conversion routines, "best"
is dependent upon the current floating-point rounding mode.
MLton uses David Gay’s
gdtoa library for floating-point
conversions. While this is an exellent library, it generalizes the
decimal-to-binary and binary-to-decimal conversion routines beyond
what is required by the
Standard ML Basis Library and induces an
external dependency on the compiler. Native implementations of these
conversion routines in Standard ML would obviate the dependency on the
gdtoa
library, while also being able to take advantage of Standard
ML features in the implementation (e.g., the published algorithms
often require use of infinite precision arithmetic, which is provided
by the IntInf
structure in Standard ML, but is provided in an ad hoc
fasion in the gdtoa
library).
This project aims to develop a native implementation of the conversion routines in Standard ML.
Background:
-
What every computer scientist should know about floating-point arithmetic; David Goldberg
-
How to print floating-point numbers accurately; Guy L. Steele, Jr. and Jon L. White
-
How to read floating point numbers accurately; William D. Clinger
-
Correctly Rounded Binary-Decimal and Decimal-Binary Conversions; David Gay
-
Printing floating-point numbers quickly and accurately; Robert G. Burger and R. Kent Dybvig
-
Printing floating-point numbers quickly and accurately with integers; Florian Loitsch
Recommended Skills: SML programming experience; algorithm design and implementation
Implement Source-level Debugging
Debugging is a fact of programming life. Unfortunately, most SML implementations (including MLton) provide little to no source-level debugging support. This project aims to add basic to intermediate source-level debugging support to the MLton compiler. MLton already supports source-level profiling, which can be used to attribute bytes allocated or time spent in source functions. It should be relatively straightforward to leverage this source-level information into basic source-level debugging support, with the ability to set/unset breakpoints and step through declarations and functions. It may be possible to also provide intermediate source-level debugging support, with the ability to inspect in-scope variables of basic types (e.g., types compatible with MLton’s foreign function interface).
Background:
Recommended Skills: SML programming experience; some compiler experience
Region Based Memory Management
Region based memory management is an alternative automatic memory management scheme to garbage collection. Regions can be inferred by the compiler (e.g., Cyclone and MLKit) or provided to the programmer through a library. Since many students do not have extensive experience with compilers we plan on adopting the later approach. Creating a viable region based memory solution requires the removal of the GC and changes to the allocator. Additionally, write barriers will be necessary to ensure references between two ML objects is never established if the left hand side of the assignment has a longer lifetime than the right hand side. Students will need to come up with an appropriate interface for creating, entering, and exiting regions (examples include RTSJ scoped memory and SCJ scoped memory).
Background:
-
Cyclone
-
MLKit
-
RTSJ + SCJ scopes
Recommended Skills: SML programming experience; C programming experience; some compiler and garbage collector experience
Adding Real-Time Capabilities
This project focuses on exposing real-time APIs from a real-time OS
kernel at the SML level. This will require mapping the current MLton
(or MultiMLton) threading framework
to real-time threads that the RTOS provides. This will include
associating priorities with MLton threads and building priority based
scheduling algorithms. Additionally, support for perdioc, aperiodic,
and sporadic tasks should be supported. A real-time SML library will
need to be created to provide a forward facing interface for
programmers. Stretch goals include reworking the MLton atomic
statement and associated synchronization primitives built on top of
the MLton atomic
statement.
Recommended Skills: SML programming experience; C programming experience; real-time experience a plus but not required
Real-Time Garbage Collection
This project focuses on modifications to the MLton GC to support real-time garbage collection. We will model the real-time GC on the Schism RTGC. The first task will be to create a fixed size runtime object representation. Large structures will need to be represented as a linked lists of fixed sized objects. Arrays and vectors will be transferred into dense trees. Compaction and copying can therefore be removed from the GC algorithms that MLton currently supports. Lastly, the GC will be made concurrent, allowing for the execution of the GC threads as the lowest priority task in the system. Stretch goals include a priority aware mechanism for the GC to signal to real-time ML threads that it needs to scan their stack and identification of places where the stack is shallow to bound priority inversion during this procedure.
Recommended Skills: C programming experience; garbage collector experience a plus but not required