blob: 4aadfad1318bb77423086d7c3ca5dd3537cb9443 [file] [log] [blame]
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
A BRIEF OUTLINE OF THE DESIGN OF THE NEW OUTPUT WINDOW
The new output window is designed to solve several problems:
- Accessibility/Usability (uses standard swing text pane rather than a terminal emulator)
- Performance - minimize heap use, memory copies and event queue flooding
At its core, the output window is a javax.swing.text.Document which gets its data directly
from a memory mapped file (PENDING: heap based storage is implemented and may be more
efficient for small amounts of output). Fetching IOProvider instances and InputOutput objects, and
writing to them is thread safe.
The classes of interest are these, interfaces implemented in brackets:
IO/INFRASTRUCTURE:
- NbIOProvider [IOProvider]- does little except provide instances of NbIO and track them by name
- NbIO [InputOutput] - Constructs/manages the lifecycle of an OutWriter; forwards calls setters that
affect GUI presentation on the event queue.
- OutWriter [OutputWriter] - handles asynchronous reads and writes to the data storage,
creating it on first use and retaining it until it is disposed (which happens when an
output tab is explicitly closed by the user if the tab is not being written to). Each write call creates a new
direct allocated ByteBuffer; when a write is completed the buffer is "disposed",
causing it to be written to the memory mapped file (this really meaning that the memory
it references becomes a section of the memory mapped file). For concurrent write calls, the order
is non-deterministic, just as it is in a shell.
Reading is accomplished via FileChannel and ByteBuffer.asCharBuffer(), and is synchronized
against new writes.
Change notifications are handled via simple support for ChangeListeners. Changes are
fired only when the file is flushed or closed, and on the first write to the file.
For general repainting, a polling mechanism is supported - on each write, a dirty flag
is set, which the Document implementation can check (clearing it) by calling checkDirty().
Change notifications are posted from the thread causing the change.
- Storage - an interface to abstract the byte-based data store the OutWriter reads and writes
from. There are two implementations, FileMapStorage and HeapStorage.
- OutputDocument - This is an implementation of the java text Document interface directly
over the read methods of OutWriter. A number of optimizations are possible due to the
fact that the file will only grow, only at the end and never shrink (if a writer is reset
due to OutputWriter.reset(), a new OutputDocument should be constructed).
Change and updates are handled by generating DocumentEvents as follows: On the
initial change (first write) event a timer is started, initially running at 500ms intervals, and slowing
down if no changes have happened for 15 seconds. When the timer fires, the following
sequence of events happens:
- Check if the file is dirty or has been closed, if not, return. If closed, stop the timer.
- Check if an event has been previously posted which has not been asked for its
data. If yes, return
- Create a new OutWriter.DO, which implements DocumentEvent and post it on the
event queue
- OutputDocument.DO [DocumentEvent] - this is a somewhat unusual event implementation.
What happens is this: An event is created knowing only the end position of the last
posted event (or 0). It will have no other information until something calls one of its
methods. At that point it briefly synchronizes on the writer and gets the *current*
line count and number of characters written, storing those values. In this way, we do
not generate events unless 1. There is something new to report, and 2. There is no
unprocessed event capable of reporting it.
The output file is created in the system temp directory, and set to be deleted on JVM exit.
For throughput reasons, it is in UTF-16 format for its lifetime. A Save As function is supported,
which will copy the file to a user-specified location, converting its encoding in the process.
WORD WRAPPING
Doing effective, fast-performing word wrapping is harder than it seems - making it scale
is the biggie. So this is heavily optimized. Main points:
- javax.swing.text's WrappedPlainView (what you get if you set word wrap on a JTextArea)
is absolutely out - to figure out where to put the caret, it will iterate every char
in the document, to expand tabs & such). Solution: Don't expand tabs, and don't use
WrappedPlainView.
- There are a bunch of optimizations possible because of a couple things:
- The document will never change except at the end
- We know we're always using a fixed width font
Consider that, if you want to know what position in the document a wrapped character position
on a component is, you need to iterate from the top of the document down, counting wrapped lines
until you find the one you're on. Once you hit a few hundred thousand lines, it can take a
second or so for the caret to move when you click!
What we do is, above a small output file, we cache the number of logical lines
*above* each line which is wrapped. For a query of any given line, we find the first line
above it which *is* wrapped, take the cached value and add in the distance to that line.
Caching only those lines which are wrapped keeps the cache nice and small, and so at worst
it's a couple binary searches and integer adds to get the right value.
The cache is initially built the first time something asks for wrap information. If the
document is still being written, it will be added to for any lines that go over the wrap
point.
The achilles heel is that, if the output window width changes, the cache must be rebuilt,
and that can be expensive (though even at 400000 lines it's less than 1s on a reasonable
machine, and not many people are reading 400000 lines of output).
UI ARCHITECTURE
The GUI design is built around pure MVC design and limiting usage of the listener pattern.
It is composed of two layers:
- Base ui classes in org.netbeans.core.output2.ui, which implement things like custom
caret and scrollbar handling, and automatically using a tabbed pane for multiple
components
- Implementation classes - the GUI components contain no output-specific logic - rather
they find (via the component hierarchy) the OutputController which is managing the
parent component; it handles all of the logic
The result of this design is that all information lives in one place only, and all logic
(which can apply to the output window, the output tab, or the text view inside it) is
handled in a single location, by a stateless controller. For everything the gui does,
there is a method on the controller class - much more understandable than smidgins of
logic embedded in a handful of listeners.
THREAD SAFETY
In addition to the above mentioned mechanism for threading between output writes and
document events, Operations on InputOutput[NbIO] (setting the input area visible, causing an output view to become
selected, etc.) are handled by dispatching IOEvents (a simple event class with an integer
ID and a boolean value) on the event thread. Operations invoked from the event thread will
be run synchronously.
CARE AND FEEDING
For writing patches/maintaining this package, there are a few simple rules:
1. There shall be no listeners! Yes it's all doable with listeners, it's just not
maintainable that way. Resist the temptation and do it right.
2. For any information that needs to go back and forth, create a getter or setter on the
GUI component closest to that information, and a set of callbacks through the hierarchy to
the controller (just see what any of the existing calls do)
3. There shall be no dependencies in classes in the ui package on classes in the output2
package (with a minor exception for logging purposes only). Pure GUI logic lives in the
components. Functional logic lives in the controller.
AMBIGUITIES IN THE API AND HOW THEY ARE DEALT WITH
There are a number of ambiguities in the original API, which any implementation must resolve somehow.
They are as follows:
- reset() method should be on InputOutput, but is instead on OutputWriter. Given a single
merged output, there is no meaningful way to handle the situation of error output that has been reset
plus a stdout that has not been reset. Calls to reset() on the error writer are ignored.
- close() method should be on InputOutput, not OutputWriter. There is no meaningful way to
handle a closed error + an unclosed output, so this is handled as follows:
- If getErr() has never been called, the stream will be considered closed when getOut().close()
has been called
- If getErr() *has* been called, the stream will not be considered closed until both
getErr().close() and getOut().close() have been called
- Behavior of writes or other calls after calling closeInputOutput() are undefined. For this
implementation, they are no-ops. Once you have called closeInputOutput(), the InputOutput
is dead for good and should be discarded.
- How to handle output when the user has closed a tab (there is no way to notify the client
that the output component has been closed). If the user closes a tab, the problem flag is
set on the output. Any subsequent writes will go to dev/null until reset() is called, at
which point a tab for the InputOutput may be opened again.
OPEN QUESTIONS:
- Need to measure the overhead of using a memory mapped file by default - for large amounts of
output there is no measurable difference, but there may be a penalty for small ones. To
try heap-based storage, run with -J-Dnb.output.heap=true