mirrors/foot - Forgejo: Beyond coding. We Forge.

mirror of https://codeberg.org/dnkl/foot.git synced 2026-02-04 04:06:06 -05:00

Author	SHA1	Message	Date
Daniel Eklöf	804642580e	meson: don't generate pre-compose table when -Dunicode-precompose=false	2020-05-02 18:43:13 +02:00
Daniel Eklöf	d945b68b73	unicode-combine: remove utf8proc dependency We only used utf8proc to try to pre-compose a glyph from a base and combining character. We can do this ourselves by using a pre-compiled table of valid pre-compositions. This table isn't _that_ big, and binary searching it is fast. That is, for a very small amount of code, and not too much extra RO data, we can get rid of the utf8proc dependency.	2020-05-02 17:29:00 +02:00
Daniel Eklöf	8389c76549	unicode-combining: don't limit ourselves to the (western) diacritics blocks	2020-05-02 16:11:51 +02:00
Daniel Eklöf	50543983ad	unicode-combine: only compose if we don't have any other combining characters If the client sent the sequence SAB, where SA does NOT have a composed representation, but SB does, the old code would compose SB and throw away A. This patch fixes this by only allowing a compose if there aren't any pre-existing combining characters.	2020-05-01 20:17:37 +02:00
Daniel Eklöf	cb5f80ec6a	vt: utf8: track combining characters that we failed to compose When we detect a combining character, we first try to compose it with the base character (like before). When this fails, we instead add the combining character to the base cell's combining characters array. The reason for using a composed character when possible is twofold: one, the rendered glyph will look better since it will be a single glyph instead of two separate glyphs (possibly from different fonts(!)). And two, for performance. A composed glyph is a single glyph to render, while a decomposed glyph sequence means the renderer has to render multiple glyphs for a single cell.	2020-05-01 11:52:40 +02:00
Daniel Eklöf	69c3e74498	util.h: new header file defining commonly used macros	2020-05-01 11:46:24 +02:00
Daniel Eklöf	3f3fff768a	vt: lazily reset utf8 in action_utf8_*_entry action_clear() is in the super hot code path. Avoid resetting utf8 state there, as utf8 input is relatively uncommon. Instead, reset it when we explicitly enter any of the utf8 collecting states, as this is exactly the point where we need it.	2020-04-27 15:50:44 +02:00
Daniel Eklöf	d1fc419e34	vt: action_utf8_print: idx is cleared in action_clear()	2020-04-27 15:49:07 +02:00
Daniel Eklöf	4278af99d2	vt: utf8-*-entry: idx is cleared in action_clear()	2020-04-27 15:47:44 +02:00
Daniel Eklöf	e478874dd9	term: remove unneeded utf8.left member	2020-04-27 15:06:23 +02:00
Daniel Eklöf	4283a8c51b	utf8: add support for unicode combining characters This feature lets foot combine e.g. "a\u0301" to "á". We first check if the current character (that we're about to print) is a combining character, by checking if it's in one of the following ranges: * Combining Diacritical Marks (0300–036F), since version 1.0, with modifications in subsequent versions down to 4.1 * Combining Diacritical Marks Extended (1AB0–1AFF), version 7.0 * Combining Diacritical Marks Supplement (1DC0–1DFF), versions 4.1 to 5.2 * Combining Diacritical Marks for Symbols (20D0–20FF), since version 1.0, with modifications in subsequent versions down to 5.1 * Combining Half Marks (FE20–FE2F), versions 1.0, with modifications in subsequent versions down to 8.0 If it is, we check if the last cell appears to contain a valid symbol, and if so, we attempt to compose (combine) the last cell with the current character, using utf8proc. If the result is a combined character, we replace the content in the previous cell with the new, combined character. Thus, if you select and copy the printed character, you would get e.g. "\u00e1" instead of "a\u0301". This feature can be disabled. By default, it is enabled if the utf8proc library is found, but can be explicitly disabled, or enabled, with 'meson -Dunicode-combining=disabled\|enabled'.	2020-04-27 12:13:30 +02:00
Daniel Eklöf	89559d5466	grid: move 'cursor' state from terminal to grid This way, the 'normal' and 'alt' grids have their own cursor state, and we don't need to switch between them.	2020-04-16 18:51:14 +02:00
Daniel Eklöf	d67f437458	mbstate: fix compile warning on systems where mbstate_t isn't an integral An empty initializer still ensures the entire object is zero-initialized.	2020-04-13 11:58:38 +02:00
Daniel Eklöf	1006608093	alt-screen: use a custom 'saved' cursor when switching to alt screen This fixes an issue where we failed to restore the cursor correctly when exiting from the alternate screen, if the client had sent escapes to save the cursor position while inside the alternate screen. This was because we used the same storage for saving the cursor position through escapes, as for saving it when entering the alternate screen. Fix by using a custom variable dedicated to normal <--> alt screen switching.	2020-03-16 12:00:25 +01:00
Daniel Eklöf	4a169f5643	vt: tag cells that were form-feed:ed, to allow correct text reflow To handle text reflow correctly when a line has a printable character in the last column, but was still line breaked, we need to track the fact that the slave inserted a line break here. Otherwise, when the window width is increased, we'll end up pulling up the next line, when we really should have inserted a line break.	2020-02-10 21:54:37 +01:00
Daniel Eklöf	8c32e3ccf0	vt: ensure we never step outside our parameter and sub-parameter arrays We only support 16 parameters, and for each parameter, 16 sub-parameters. If we ever hit that limit (or rather, if the client writes 17 (sub) parameters), log this and stop incrementing the parameter index variable. For performance reason, we implement the following behavior: * We never increment the parameter index past the supported number. This ensures all code accessing the parameter list can do so without verifying the validity of the index. * The first time we see too many parameters, and the first time we see too many sub parameters, log this. Then never log again. Even if we see too many parameters in a completely different escape. This is so that we don't have to keep a "have warned" boolean in the terminal struct, but can use a simple function local static variable.	2020-02-01 19:44:56 +01:00
Daniel Eklöf	07a0c7238c	vt: collect (intermediate): log a warning if user supplied more than two intermediates	2020-02-01 19:29:31 +01:00
Daniel Eklöf	bbb7b60b17	vt: collect (intermediate): log _which_ character we collected	2020-02-01 19:29:14 +01:00
Daniel Eklöf	4a64e4aebc	vt: bug: state machine: csi entry: handle 0x3a/0x3b correctly 0x3a/0x3b are ':' and ';'. These should not only switch to the 'csi param' state, but also be parsed as a parameter. This fixes an issue where a multi-parameter escape with the first parameter omitted was parsed incorrectly - as if the first parameter wasn't there. I.e. "\e[;123r" was parsed as "\e[123r"	2020-01-26 00:44:53 +01:00
Daniel Eklöf	75b8fc52b8	vt: bug: fix check for error from mbrtowc() mbrtowc() returns an unsigned. Need to cast to signed before checking if less than zero. This fixes an issue where invalid utf-8 sequences where treated as valid.	2020-01-23 17:39:25 +01:00
Daniel Eklöf	300f83e66b	term: factor out character printing to new function term_print()	2020-01-20 18:34:32 +01:00
Daniel Eklöf	5a6cbb8c3e	dcs: initial handling of DCS in general Add data structure to term->vt. This structure tracks the free-form data that is passed-through, and the handler to call at the end. Intermediates and parameters are collected by the normal VT parser. Then, when we enter the passthrough state, we call dcs_hook(). This function checks the intermediate(s) and parameters, and selects the appropriate unhook handler (and optionally does some execution already). In passthrough mode, we simply append strings to an internal buffer. This might have to be changed in the future, if we need to support a DCS that needs to execute as we go. In unhook (i.e. when the DCS is terminated), we execute the unhook handler. As a proof-of-concept, handlers for BSU/ESU (Begin/End Synchronized Update) has been added (but are left unimplemented).	2020-01-12 11:55:22 +01:00
Daniel Eklöf	56824e459d	Revert "vt: refactor" This reverts commit `a575204bc7`.	2019-12-20 23:59:23 +01:00
Daniel Eklöf	a575204bc7	vt: refactor	2019-12-20 23:45:21 +01:00
Daniel Eklöf	1bc8562026	vt: visually compact the switch tables	2019-12-20 23:38:16 +01:00
Daniel Eklöf	5a0e27fd6c	vt: remove enum action; add separate functions for each action instead	2019-12-20 23:27:15 +01:00
Daniel Eklöf	032f478661	vt: remove debug assert	2019-12-20 23:26:18 +01:00
Daniel Eklöf	9ad9e4ccaf	vt: use a pointer that we increment, instead of indexing	2019-12-20 23:00:07 +01:00
Daniel Eklöf	914b96cc9a	vt: use break, not continue	2019-12-20 22:13:23 +01:00
Daniel Eklöf	ee8a9674c4	vt: no need to assign to term->vt.state for every input byte	2019-12-20 22:12:35 +01:00
Daniel Eklöf	f36752f4d0	vt: remove dead code	2019-12-20 22:11:35 +01:00
Daniel Eklöf	d29de6f90a	vt: don't special case UTF-8 collect state	2019-12-20 22:10:27 +01:00
Daniel Eklöf	2d79497093	vt: convert SOS/PM/APC string from table lookup to switch	2019-12-20 21:50:54 +01:00
Daniel Eklöf	ad1773d7bc	vt: convert DCS from table lookup to switch	2019-12-20 21:48:04 +01:00
Daniel Eklöf	dca403e100	vt: convert CSI ignore from table lookup to switch	2019-12-20 21:13:06 +01:00
Daniel Eklöf	0d6555bea9	vt: convert CSI intermediate from table lookup to switch	2019-12-20 21:09:00 +01:00
Daniel Eklöf	d325ae10ee	vt: convert CSI param from table lookup to switch	2019-12-20 21:04:47 +01:00
Daniel Eklöf	23a6c6b711	vt: add missing 'entry' actions to 'anywhere' sections	2019-12-20 20:58:02 +01:00
Daniel Eklöf	b1fd960b4b	vt: convert CSI entry from table lookup to switch	2019-12-20 20:57:38 +01:00
Daniel Eklöf	a5f238b388	vt: re-align switches	2019-12-20 20:43:31 +01:00
Daniel Eklöf	b2f091d243	vt: replace GROUND, ESCAPE and ESCAPE_INTERMEDIATE tables with switches	2019-12-20 19:16:52 +01:00
Daniel Eklöf	56faca4266	vt: use a switch instead of a top-level state lookup table Remove the top-level state lookup table, which mapped from a state enum to a state transition table, and replace it with a switch.	2019-12-20 18:24:32 +01:00
Daniel Eklöf	2c4af8728d	vt: add commented out cases for 8-bit C1 control characters XTerm seems to ignore these when in UTF-8 mode. Since we _only_ support UTF-8, we don't need to recognize these control characters at all. However, it may be good to have them here for reference. So add them, but commented out, along with their corresponding 7-bit versions (which we _do_ recognize and implement).	2019-12-14 20:28:05 +01:00
Daniel Eklöf	0e5a69d869	vt: don't try to move cursor outside the terminal When we insert an auto-newline, we must make sure we don't try to move outside the terminal window. This can for example happen when a scrolling region have been configured, and the cursor is outside the scrolling region (i.e. it's in the bottom margin).	2019-11-30 00:32:34 +01:00
Daniel Eklöf	88c1a8939f	vt: fix memory corruption: wcwidth() may return -1 When it did, we called print_insert() with that, which in turn resulted in a too large size value passed to memmove.	2019-11-30 00:15:05 +01:00
Daniel Eklöf	9551be492c	csi/vt: don't bad client data as errors This gets rid of spam when cat:ing binary data.	2019-11-30 00:12:30 +01:00
Daniel Eklöf	66f941d00a	vt: only define esc_as_string() when debug logging has been enabled	2019-11-30 00:02:19 +01:00
Daniel Eklöf	cd9510aa7b	vt: disable logging BELL	2019-11-30 00:00:41 +01:00
Daniel Eklöf	616896e2a5	csi/ocs/vt: log unhandled/unrecognized sequences as debug messages Having them as error messages was nice when we where still missing lots of sequences. Now we don't anymore, and these just spam stdout as well as syslog when e.g. cat:ing binary data.	2019-11-29 23:59:24 +01:00
Daniel Eklöf	3026b8981a	vt: there are actually many state transitions that are no-ops In most states, most 8-bit values are no-ops. This is already handled; action() recognizes ACTION_NONE as a no-op. Thus, all we need to do is remove the assertion.	2019-11-29 23:38:01 +01:00

1 2 3 4 5 ...

262 commits