aboutsummaryrefslogtreecommitdiff
path: root/doc/changes.html
diff options
context:
space:
mode:
Diffstat (limited to 'doc/changes.html')
-rw-r--r--doc/changes.html882
1 files changed, 0 insertions, 882 deletions
diff --git a/doc/changes.html b/doc/changes.html
deleted file mode 100644
index 5fc74f10..00000000
--- a/doc/changes.html
+++ /dev/null
@@ -1,882 +0,0 @@
1<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN" "http://www.w3.org/TR/html4/strict.dtd">
2<html>
3<head>
4<title>LuaJIT Change History</title>
5<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
6<meta name="Copyright" content="Copyright (C) 2005-2020">
7<meta name="Language" content="en">
8<link rel="stylesheet" type="text/css" href="bluequad.css" media="screen">
9<link rel="stylesheet" type="text/css" href="bluequad-print.css" media="print">
10<style type="text/css">
11div.major { max-width: 600px; padding: 1em; margin: 1em 0 1em 0; }
12</style>
13</head>
14<body>
15<div id="site">
16<a href="http://luajit.org"><span>Lua<span id="logo">JIT</span></span></a>
17</div>
18<div id="head">
19<h1>LuaJIT Change History</h1>
20</div>
21<div id="nav">
22<ul><li>
23<a href="luajit.html">LuaJIT</a>
24<ul><li>
25<a href="http://luajit.org/download.html">Download <span class="ext">&raquo;</span></a>
26</li><li>
27<a href="install.html">Installation</a>
28</li><li>
29<a href="running.html">Running</a>
30</li></ul>
31</li><li>
32<a href="extensions.html">Extensions</a>
33<ul><li>
34<a href="ext_ffi.html">FFI Library</a>
35<ul><li>
36<a href="ext_ffi_tutorial.html">FFI Tutorial</a>
37</li><li>
38<a href="ext_ffi_api.html">ffi.* API</a>
39</li><li>
40<a href="ext_ffi_semantics.html">FFI Semantics</a>
41</li></ul>
42</li><li>
43<a href="ext_jit.html">jit.* Library</a>
44</li><li>
45<a href="ext_c_api.html">Lua/C API</a>
46</li><li>
47<a href="ext_profiler.html">Profiler</a>
48</li></ul>
49</li><li>
50<a href="status.html">Status</a>
51<ul><li>
52<a class="current" href="changes.html">Changes</a>
53</li></ul>
54</li><li>
55<a href="faq.html">FAQ</a>
56</li><li>
57<a href="http://luajit.org/performance.html">Performance <span class="ext">&raquo;</span></a>
58</li><li>
59<a href="http://wiki.luajit.org/">Wiki <span class="ext">&raquo;</span></a>
60</li><li>
61<a href="http://luajit.org/list.html">Mailing List <span class="ext">&raquo;</span></a>
62</li></ul>
63</div>
64<div id="main">
65<p>
66This is a list of changes between the released versions of LuaJIT.<br>
67The current <span style="color: #0000c0;">stable version</span> is <strong>LuaJIT&nbsp;2.0.5</strong>.<br>
68</p>
69<p>
70Please check the
71<a href="http://luajit.org/changes.html"><span class="ext">&raquo;</span>&nbsp;Online Change History</a>
72to see whether newer versions are available.
73</p>
74
75<div class="major" style="background: #d0d0ff;">
76<h2 id="LuaJIT-2.1.0-beta3">LuaJIT 2.1.0-beta3 &mdash; 2017-05-01</h2>
77<ul>
78<li>Rewrite memory block allocator.</li>
79<li>Add various extension from Lua 5.2/5.3.</li>
80<li>Remove old Lua 5.0 compatibility defines.</li>
81<li>Set arg table before evaluating <tt>LUA_INIT</tt> and <tt>-e</tt> chunks.</li>
82<li>Fix FOLD rules for <tt>math.abs()</tt> and FP negation.</li>
83<li>Fix soft-float <tt>math.abs()</tt> and negation.</li>
84<li>Fix formatting of some small denormals at low precision.</li>
85<li>LJ_GC64: Add JIT compiler support.</li>
86<li>x64/LJ_GC64: Add JIT compiler backend.</li>
87<li>x86/x64: Generate BMI2 shifts and rotates, if available.</li>
88<li>Windows/x86: Add full exception interoperability.</li>
89<li>ARM64: Add big-endian support.</li>
90<li>ARM64: Add JIT compiler backend.</li>
91<li>MIPS: Fix <tt>TSETR</tt> barrier.</li>
92<li>MIPS: Support MIPS16 interlinking.</li>
93<li>MIPS soft-float: Fix code generation for <tt>HREF</tt>.</li>
94<li>MIPS64: Add MIPS64 hard-float JIT compiler backend.</li>
95<li>MIPS64: Add MIPS64 hard-float/soft-float support to interpreter.</li>
96<li>FFI: Compile bitfield loads/stores.</li>
97<li>Various fixes common with the 2.0 branch.</li>
98</ul>
99
100<h2 id="LuaJIT-2.1.0-beta2">LuaJIT 2.1.0-beta2 &mdash; 2016-03-03</h2>
101<ul>
102<li>Enable trace stitching.</li>
103<li>Use internal implementation for converting FP numbers to strings.</li>
104<li>Parse Unicode escape <tt>'\u{XX...}'</tt> in string literals.</li>
105<li>Add MIPS soft-float support.</li>
106<li>Switch MIPS port to dual-number mode.</li>
107<li>x86/x64: Add support for AES-NI, AVX and AVX2 to DynASM.</li>
108<li>FFI: Add <tt>ssize_t</tt> declaration.</li>
109<li>FFI: Parse <tt>#line NN</tt> and <tt>#NN</tt>.</li>
110<li>Various minor fixes.</li>
111</ul>
112
113<h2 id="LuaJIT-2.1.0-beta1">LuaJIT 2.1.0-beta1 &mdash; 2015-08-25</h2>
114<p>
115This is a brief summary of the major changes in LuaJIT 2.1 compared to 2.0.
116Please take a look at the commit history for more details.
117</p>
118<ul>
119<li>Changes to the VM core:
120<ul>
121<li>Add low-overhead profiler (<tt>-jp</tt>).</li>
122<li>Add <tt>LJ_GC64</tt> mode: 64 bit GC object references (really: 47 bit). Interpreter-only for now.</li>
123<li>Add <tt>LJ_FR2</tt> mode: Two-slot frame info. Required by <tt>LJ_GC64</tt> mode.</li>
124<li>Add <tt>table.new()</tt> and <tt>table.clear()</tt>.</li>
125<li>Parse binary number literals (<tt>0bxxx</tt>).</li>
126</ul></li>
127<li>Improvements to the JIT compiler:
128<ul>
129<li>Add trace stitching (disabled for now).</li>
130<li>Compile various builtins: <tt>string.char()</tt>, <tt>string.reverse()</tt>, <tt>string.lower()</tt>, <tt>string.upper()</tt>, <tt>string.rep()</tt>, <tt>string.format()</tt>, <tt>table.concat()</tt>, <tt>bit.tohex()</tt>, <tt>getfenv(0)</tt>, <tt>debug.getmetatable()</tt>.</li>
131<li>Compile <tt>string.find()</tt> for fixed string searches (no patterns).</li>
132<li>Compile <tt>BC_TSETM</tt>, e.g. <tt>{1,2,3,f()}</tt>.</li>
133<li>Compile string concatenations (<tt>BC_CAT</tt>).</li>
134<li>Compile <tt>__concat</tt> metamethod.</li>
135<li>Various minor optimizations.</li>
136</ul></li>
137<li>Internal Changes:
138<ul>
139<li>Add support for embedding LuaJIT bytecode for builtins.</li>
140<li>Replace various builtins with embedded bytecode.</li>
141<li>Refactor string buffers and string formatting.</li>
142<li>Remove obsolete non-truncating number to integer conversions.</li>
143</ul></li>
144<li>Ports:
145<ul>
146<li>Add Xbox One port (<tt>LJ_GC64</tt> mode).</li>
147<li>ARM64: Add port of the interpreter (<tt>LJ_GC64</tt> mode).</li>
148<li>x64: Add separate port of the interpreter to <tt>LJ_GC64</tt> mode.</li>
149<li>x86/x64: Drop internal x87 math functions. Use libm functions.</li>
150<li>x86: Remove x87 support from interpreter. SSE2 is mandatory now.</li>
151<li>PPC/e500: Drop support for this architecture.</li>
152</ul></li>
153<li>FFI library:
154<ul>
155<li>FFI: Add 64 bit bitwise operations.</li>
156<li>FFI: Compile VLA/VLS and large cdata allocations with default initialization.</li>
157<li>FFI: Compile conversions from functions to function pointers.</li>
158<li>FFI: Compile lightuserdata to <tt>void *</tt> conversion.</li>
159<li>FFI: Compile <tt>ffi.gc(cdata, nil)</tt>, too.</li>
160<li>FFI: Add <tt>ffi.typeinfo()</tt>.</li>
161</ul></li>
162</ul>
163</div>
164
165<div class="major" style="background: #ffffd0;">
166<h2 id="LuaJIT-2.0.5">LuaJIT 2.0.5 &mdash; 2017-05-01</h2>
167<ul>
168<li>Add workaround for MSVC 2015 stdio changes.</li>
169<li>Limit mcode alloc probing, depending on the available pool size.</li>
170<li>Fix overly restrictive range calculation in mcode allocation.</li>
171<li>Fix out-of-scope goto handling in parser.</li>
172<li>Remove internal <tt>__mode = "K"</tt> and replace with safe check.</li>
173<li>Add "proto" field to <tt>jit.util.funcinfo()</tt>.</li>
174<li>Fix GC step size calculation.</li>
175<li>Initialize <tt>uv-&gt;immutable</tt> for upvalues of loaded chunks.</li>
176<li>Fix for cdata vs. non-cdata arithmetics/comparisons.</li>
177<li>Drop leftover regs in 'for' iterator assignment, too.</li>
178<li>Fix PHI remarking in SINK pass.</li>
179<li>Don't try to record outermost <tt>pcall()</tt> return to lower frame.</li>
180<li>Add guard for obscure aliasing between open upvalues and SSA slots.</li>
181<li>Remove assumption that <tt>lj_math_random_step()</tt> doesn't clobber FPRs.</li>
182<li>Fix handling of non-numeric strings in arithmetic coercions.</li>
183<li>Fix recording of <tt>select(n, ...)</tt> with off-trace varargs</li>
184<li>Fix install for cross-builds.</li>
185<li>Don't allocate unused 2nd result register in JIT compiler backend.</li>
186<li>Drop marks from replayed instructions when sinking.</li>
187<li>Fix unsinking check.</li>
188<li>Properly handle OOM in <tt>trace_save()</tt>.</li>
189<li>Limit number of arguments given to <tt>io.lines()</tt> and <tt>fp:lines()</tt>.</li>
190<li>Fix narrowing of <tt>TOBIT</tt>.</li>
191<li>OSX: Fix build with recent XCode.</li>
192<li>x86/x64: Don't spill an explicit <tt>REF_BASE</tt> in the IR.</li>
193<li>x86/x64: Fix instruction length decoder.</li>
194<li>x86/x64: Search for exit jumps with instruction length decoder.</li>
195<li>ARM: Fix <tt>BLX</tt> encoding for Thumb interworking calls.</li>
196<li>MIPS: Don't use <tt>RID_GP</tt> as a scratch register.</li>
197<li>MIPS: Fix emitted code for U32 to float conversion.</li>
198<li>MIPS: Backport workaround for compact unwind tables.</li>
199<li>MIPS: Fix cross-endian jit.bcsave.</li>
200<li>MIPS: Fix <tt>BC_ISNEXT</tt> fallback path.</li>
201<li>MIPS: Fix use of ffgccheck delay slots in interpreter.</li>
202<li>FFI: Fix FOLD rules for <tt>int64_t</tt> comparisons.</li>
203<li>FFI: Fix SPLIT pass for <tt>CONV i64.u64</tt>.</li>
204<li>FFI: Fix <tt>ipairs()</tt> recording.</li>
205<li>FFI: Don't propagate qualifiers into subtypes of complex.</li>
206</ul>
207
208<h2 id="LuaJIT-2.0.4">LuaJIT 2.0.4 &mdash; 2015-05-14</h2>
209<ul>
210<li>Fix stack check in narrowing optimization.</li>
211<li>Fix Lua/C API typecheck error for special indexes.</li>
212<li>Fix string to number conversion.</li>
213<li>Fix lexer error for chunks without tokens.</li>
214<li>Don't compile <tt>IR_RETF</tt> after <tt>CALLT</tt> to ff with-side effects.</li>
215<li>Fix <tt>BC_UCLO</tt>/<tt>BC_JMP</tt> join optimization in Lua parser.</li>
216<li>Fix corner case in string to number conversion.</li>
217<li>Gracefully handle <tt>lua_error()</tt> for a suspended coroutine.</li>
218<li>Avoid error messages when building with Clang.</li>
219<li>Fix snapshot #0 handling for traces with a stack check on entry.</li>
220<li>Fix fused constant loads under high register pressure.</li>
221<li>Invalidate backpropagation cache after DCE.</li>
222<li>Fix ABC elimination.</li>
223<li>Fix debug info for main chunk of stripped bytecode.</li>
224<li>Fix FOLD rule for <tt>string.sub(s, ...) == k</tt>.</li>
225<li>Fix FOLD rule for <tt>STRREF</tt> of <tt>SNEW</tt>.</li>
226<li>Fix frame traversal while searching for error function.</li>
227<li>Prevent GC estimate miscalculation due to buffer growth.</li>
228<li>Prevent adding side traces for stack checks.</li>
229<li>Fix top slot calculation for snapshots with continuations.</li>
230<li>Fix check for reuse of SCEV results in <tt>FORL</tt>.</li>
231<li>Add PS Vita port.</li>
232<li>Fix compatibility issues with Illumos.</li>
233<li>Fix DragonFly build (unsupported).</li>
234<li>OpenBSD/x86: Better executable memory allocation for W^X mode.</li>
235<li>x86: Fix argument checks for <tt>ipairs()</tt> iterator.</li>
236<li>x86: <tt>lj_math_random_step()</tt> clobbers XMM regs on OSX Clang.</li>
237<li>x86: Fix code generation for unused result of <tt>math.random()</tt>.</li>
238<li>x64: Allow building with <tt>LUAJIT_USE_SYSMALLOC</tt> and <tt>LUAJIT_USE_VALGRIND</tt>.</li>
239<li>x86/x64: Fix argument check for bit shifts.</li>
240<li>x86/x64: Fix code generation for fused test/arith ops.</li>
241<li>ARM: Fix write barrier check in <tt>BC_USETS</tt>.</li>
242<li>PPC: Fix red zone overflow in machine code generation.</li>
243<li>PPC: Don't use <tt>mcrxr</tt> on PPE.</li>
244<li>Various archs: Fix excess stack growth in interpreter.</li>
245<li>FFI: Fix FOLD rule for <tt>TOBIT</tt> + <tt>CONV num.u32</tt>.</li>
246<li>FFI: Prevent DSE across <tt>ffi.string()</tt>.</li>
247<li>FFI: No meta fallback when indexing pointer to incomplete struct.</li>
248<li>FFI: Fix initialization of unions of subtypes.</li>
249<li>FFI: Fix cdata vs. non-cdata arithmetic and comparisons.</li>
250<li>FFI: Fix <tt>__index</tt>/<tt>__newindex</tt> metamethod resolution for ctypes.</li>
251<li>FFI: Fix compilation of reference field access.</li>
252<li>FFI: Fix frame traversal for backtraces with FFI callbacks.</li>
253<li>FFI: Fix recording of indexing a struct pointer ctype object itself.</li>
254<li>FFI: Allow non-scalar cdata to be compared for equality by address.</li>
255<li>FFI: Fix pseudo type conversions for type punning.</li>
256</ul>
257
258<h2 id="LuaJIT-2.0.3">LuaJIT 2.0.3 &mdash; 2014-03-12</h2>
259<ul>
260<li>Add PS4 port.</li>
261<li>Add support for multilib distro builds.</li>
262<li>Fix OSX build.</li>
263<li>Fix MinGW build.</li>
264<li>Fix Xbox 360 build.</li>
265<li>Improve ULOAD forwarding for open upvalues.</li>
266<li>Fix GC steps threshold handling when called by JIT-compiled code.</li>
267<li>Fix argument checks for <tt>math.deg()</tt> and <tt>math.rad()</tt>.</li>
268<li>Fix <tt>jit.flush(func|true)</tt>.</li>
269<li>Respect <tt>jit.off(func)</tt> when returning to a function, too.</li>
270<li>Fix compilation of <tt>string.byte(s, nil, n)</tt>.</li>
271<li>Fix line number for relocated bytecode after closure fixup</li>
272<li>Fix frame traversal for backtraces.</li>
273<li>Fix ABC elimination.</li>
274<li>Fix handling of redundant PHIs.</li>
275<li>Fix snapshot restore for exit to function header.</li>
276<li>Fix type punning alias analysis for constified pointers</li>
277<li>Fix call unroll checks in the presence of metamethod frames.</li>
278<li>Fix initial maxslot for down-recursive traces.</li>
279<li>Prevent BASE register coalescing if parent uses <tt>IR_RETF</tt>.</li>
280<li>Don't purge modified function from stack slots in <tt>BC_RET</tt>.</li>
281<li>Fix recording of <tt>BC_VARG</tt>.</li>
282<li>Don't access dangling reference to reallocated IR.</li>
283<li>Fix frame depth display for bytecode dump in <tt>-jdump</tt>.</li>
284<li>ARM: Fix register allocation when rematerializing FPRs.</li>
285<li>x64: Fix store to upvalue for lightuserdata values.</li>
286<li>FFI: Add missing GC steps for callback argument conversions.</li>
287<li>FFI: Properly unload loaded DLLs.</li>
288<li>FFI: Fix argument checks for <tt>ffi.string()</tt>.</li>
289<li>FFI/x64: Fix passing of vector arguments to calls.</li>
290<li>FFI: Rehash finalizer table after GC cycle, if needed.</li>
291<li>FFI: Fix <tt>cts-&gt;L</tt> for cdata unsinking in snapshot restore.</li>
292</ul>
293
294<h2 id="LuaJIT-2.0.2">LuaJIT 2.0.2 &mdash; 2013-06-03</h2>
295<ul>
296<li>Fix memory access check for fast string interning.</li>
297<li>Fix MSVC intrinsics for older versions.</li>
298<li>Add missing GC steps for <tt>io.*</tt> functions.</li>
299<li>Fix spurious red zone overflows in machine code generation.</li>
300<li>Fix jump-range constrained mcode allocation.</li>
301<li>Inhibit DSE for implicit loads via calls.</li>
302<li>Fix builtin string to number conversion for overflow digits.</li>
303<li>Fix optional argument handling while recording builtins.</li>
304<li>Fix optional argument handling in <tt>table.concat()</tt>.</li>
305<li>Add partial support for building with MingW64 GCC 4.8-SEH.</li>
306<li>Add missing PHI barrier to <tt>string.sub(str, a, b) == kstr</tt> FOLD rule.</li>
307<li>Fix compatibility issues with Illumos.</li>
308<li>ARM: Fix cache flush/sync for exit stubs of JIT-compiled code.</li>
309<li>MIPS: Fix cache flush/sync for JIT-compiled code jump area.</li>
310<li>PPC: Add <tt>plt</tt> suffix for external calls from assembler code.</li>
311<li>FFI: Fix snapshot substitution in SPLIT pass.</li>
312<li>FFI/x86: Fix register allocation for 64 bit comparisons.</li>
313<li>FFI: Fix tailcall in lowest frame to C&nbsp;function with bool result.</li>
314<li>FFI: Ignore <tt>long</tt> type specifier in <tt>ffi.istype()</tt>.</li>
315<li>FFI: Fix calling conventions for 32 bit OSX and iOS simulator (struct returns).</li>
316<li>FFI: Fix calling conventions for ARM hard-float EABI (nested structs).</li>
317<li>FFI: Improve error messages for arithmetic and comparison operators.</li>
318<li>FFI: Insert no-op type conversion for pointer to integer cast.</li>
319<li>FFI: Fix unroll limit for <tt>ffi.fill()</tt>.</li>
320<li>FFI: Must sink <tt>XBAR</tt> together with <tt>XSTORE</tt>s.</li>
321<li>FFI: Preserve intermediate string for <tt>const&nbsp;char&nbsp;*</tt> conversion.</li>
322</ul>
323
324<h2 id="LuaJIT-2.0.1">LuaJIT 2.0.1 &mdash; 2013-02-19</h2>
325<ul>
326<li>Don't clear frame for out-of-memory error.</li>
327<li>Leave hook when resume catches error thrown from hook.</li>
328<li>Add missing GC steps for template table creation.</li>
329<li>Fix discharge order of comparisons in Lua parser.</li>
330<li>Improve buffer handling for <tt>io.read()</tt>.</li>
331<li>OSX: Add support for Mach-O object files to <tt>-b</tt> option.</li>
332<li>Fix PS3 port.</li>
333<li>Fix/enable Xbox 360 port.</li>
334<li>x86/x64: Always mark ref for shift count as non-weak.</li>
335<li>x64: Don't fuse implicitly 32-to-64 extended operands.</li>
336<li>ARM: Fix armhf call argument handling.</li>
337<li>ARM: Fix code generation for integer math.min/math.max.</li>
338<li>PPC/e500: Fix <tt>lj_vm_floor()</tt> for Inf/NaN.</li>
339<li>FFI: Change priority of table initializer variants for structs.</li>
340<li>FFI: Fix code generation for bool call result check on x86/x64.</li>
341<li>FFI: Load FFI library on-demand for bytecode with cdata literals.</li>
342<li>FFI: Fix handling of qualified transparent structs/unions.</li>
343</ul>
344
345<h2 id="LuaJIT-2.0.0">LuaJIT 2.0.0 &mdash; 2012-11-08</h2>
346<ul>
347<li>Correctness and completeness:
348<ul>
349 <li>Fix Android/x86 build.</li>
350 <li>Fix recording of equality comparisons with <tt>__eq</tt> metamethods.</li>
351 <li>Fix detection of immutable upvalues.</li>
352 <li>Replace error with PANIC for callbacks from JIT-compiled code.</li>
353 <li>Fix builtin string to number conversion for <tt>INT_MIN</tt>.</li>
354 <li>Don't create unneeded array part for template tables.</li>
355 <li>Fix <tt>CONV.num.int</tt> sinking.</li>
356 <li>Don't propagate implicitly widened number to index metamethods.</li>
357 <li>ARM: Fix ordered comparisons of number vs. non-number.</li>
358 <li>FFI: Fix code generation for replay of sunk float fields.</li>
359 <li>FFI: Fix signedness of bool.</li>
360 <li>FFI: Fix recording of bool call result check on x86/x64.</li>
361 <li>FFI: Fix stack-adjustment for <tt>__thiscall</tt> callbacks.</li>
362</ul></li>
363</ul>
364
365<h2 id="LuaJIT-2.0.0-beta11">LuaJIT 2.0.0-beta11 &mdash; 2012-10-16</h2>
366<ul>
367<li>New features:
368<ul>
369 <li>Use ARM VFP instructions, if available (build-time detection).</li>
370 <li>Add support for ARM hard-float EABI (<tt>armhf</tt>).</li>
371 <li>Add PS3 port.</li>
372 <li>Add many features from Lua&nbsp;5.2, e.g. <tt>goto</tt>/labels.
373 Refer to <a href="extensions.html#lua52">this list</a>.</li>
374 <li>FFI: Add parameterized C types.</li>
375 <li>FFI: Add support for copy constructors.</li>
376 <li>FFI: Equality comparisons never raise an error (treat as unequal instead).</li>
377 <li>FFI: Box all accessed or returned enums.</li>
378 <li>FFI: Check for <tt>__new</tt> metamethod when calling a constructor.</li>
379 <li>FFI: Handle <tt>__pairs</tt>/<tt>__ipairs</tt> metamethods for cdata objects.</li>
380 <li>FFI: Convert <tt>io.*</tt> file handle to <tt>FILE *</tt> pointer (but as a <tt>void *</tt>).</li>
381 <li>FFI: Detect and support type punning through unions.</li>
382 <li>FFI: Improve various error messages.</li>
383</ul></li>
384<li>Build-system reorganization:
385<ul>
386 <li>Reorganize directory layout:<br>
387 <tt>lib/*</tt> &rarr; <tt>src/jit/*</tt><br>
388 <tt>src/buildvm_*.dasc</tt> &rarr; <tt>src/vm_*.dasc</tt><br>
389 <tt>src/buildvm_*.h</tt> &rarr; removed<br>
390 <tt>src/buildvm*</tt> &rarr; <tt>src/host/*</tt></li>
391 <li>Add minified Lua interpreter plus Lua BitOp (<tt>minilua</tt>) to run DynASM.</li>
392 <li>Change DynASM bit operations to use Lua BitOp</li>
393 <li>Translate only <tt>vm_*.dasc</tt> for detected target architecture.</li>
394 <li>Improve target detection for <tt>msvcbuild.bat</tt>.</li>
395 <li>Fix build issues on Cygwin and MinGW with optional MSys.</li>
396 <li>Handle cross-compiles with FPU/no-FPU or hard-fp/soft-fp ABI mismatch.</li>
397 <li>Remove some library functions for no-JIT/no-FFI builds.</li>
398 <li>Add uninstall target to top-level Makefile.</li>
399</ul></li>
400<li>Correctness and completeness:
401<ul>
402 <li>Preserve snapshot #0 PC for all traces.</li>
403 <li>Fix argument checks for <tt>coroutine.create()</tt>.</li>
404 <li>Command line prints version and JIT status to <tt>stdout</tt>, not <tt>stderr</tt>.</li>
405 <li>Fix userdata <tt>__gc</tt> separations at Lua state close.</li>
406 <li>Fix <tt>TDUP</tt> to <tt>HLOAD</tt> forwarding for <tt>LJ_DUALNUM</tt> builds.</li>
407 <li>Fix buffer check in bytecode writer.</li>
408 <li>Make <tt>os.date()</tt> thread-safe.</li>
409 <li>Add missing declarations for MSVC intrinsics.</li>
410 <li>Fix dispatch table modifications for return hooks.</li>
411 <li>Workaround for MSVC conversion bug (<tt>double</tt> &rarr; <tt>uint32_t</tt> &rarr; <tt>int32_t</tt>).</li>
412 <li>Fix FOLD rule <tt>(i-j)-i => 0-j</tt>.</li>
413 <li>Never use DWARF unwinder on Windows.</li>
414 <li>Fix shrinking of direct mapped blocks in builtin allocator.</li>
415 <li>Limit recursion depth in <tt>string.match()</tt> et al.</li>
416 <li>Fix late despecialization of <tt>ITERN</tt> after loop has been entered.</li>
417 <li>Fix <tt>'f'</tt> and <tt>'L'</tt> options for <tt>debug.getinfo()</tt> and <tt>lua_getinfo()</tt>.</li>
418 <li>Fix <tt>package.searchpath()</tt>.</li>
419 <li>OSX: Change dylib names to be consistent with other platforms.</li>
420 <li>Android: Workaround for broken <tt>sprintf("%g",&nbsp;-0.0)</tt>.</li>
421 <li>x86: Remove support for ancient CPUs without <tt>CMOV</tt> (before Pentium Pro).</li>
422 <li>x86: Fix register allocation for calls returning register pair.</li>
423 <li>x86/x64: Fix fusion of unsigned byte comparisons with swapped operands.</li>
424 <li>ARM: Fix <tt>tonumber()</tt> argument check.</li>
425 <li>ARM: Fix modulo operator and <tt>math.floor()</tt>/<tt>math.ceil()</tt> for <tt>inf</tt>/<tt>nan</tt>.</li>
426 <li>ARM: Invoke SPLIT pass for leftover <tt>IR_TOBIT</tt>.</li>
427 <li>ARM: Fix BASE register coalescing.</li>
428 <li>PPC: Fix interpreter state setup in callbacks.</li>
429 <li>PPC: Fix <tt>string.sub()</tt> range check.</li>
430 <li>MIPS: Support generation of MIPS/MIPSEL bytecode object files.</li>
431 <li>MIPS: Fix calls to <tt>floor()</tt>/<tt>ceil()</tt><tt>/trunc()</tt>.</li>
432 <li>ARM/PPC: Detect more target architecture variants.</li>
433 <li>ARM/PPC/e500/MIPS: Fix tailcalls from fast functions, esp. <tt>tostring()</tt>.</li>
434 <li>ARM/PPC/MIPS: Fix rematerialization of FP constants.</li>
435 <li>FFI: Don't call <tt>FreeLibrary()</tt> on our own EXE/DLL.</li>
436 <li>FFI: Resolve metamethods for constructors, too.</li>
437 <li>FFI: Properly disable callbacks on iOS (would require executable memory).</li>
438 <li>FFI: Fix cdecl string parsing during recording.</li>
439 <li>FFI: Show address pointed to for <tt>tostring(ref)</tt>, too.</li>
440 <li>FFI: Fix alignment of C call argument/return structure.</li>
441 <li>FFI: Initialize all fields of standard types.</li>
442 <li>FFI: Fix callback handling when new C&nbsp;types are declared in callback.</li>
443 <li>FFI: Fix recording of constructors for pointers.</li>
444 <li>FFI: Always resolve metamethods for pointers to structs.</li>
445 <li>FFI: Correctly propagate alignment when interning nested types.</li>
446</ul></li>
447<li>Structural and performance enhancements:
448<ul>
449 <li>Add allocation sinking and store sinking optimization.</li>
450 <li>Constify immutable upvalues.</li>
451 <li>Add builtin string to integer or FP number conversion. Improves cross-platform consistency and correctness.</li>
452 <li>Create string hash slots in template tables for non-const values, too. Avoids later table resizes.</li>
453 <li>Eliminate <tt>HREFK</tt> guard for template table references.</li>
454 <li>Add various new FOLD rules.</li>
455 <li>Don't use stack unwinding for <tt>lua_yield()</tt> (slow on x64).</li>
456 <li>ARM, PPC, MIPS: Improve <tt>XLOAD</tt> operand fusion and register hinting.</li>
457 <li>PPC, MIPS: Compile <tt>math.sqrt()</tt> to sqrt instruction, if available.</li>
458 <li>FFI: Fold <tt>KPTR</tt> + constant offset in SPLIT pass.</li>
459 <li>FFI: Optimize/inline <tt>ffi.copy()</tt> and <tt>ffi.fill()</tt>.</li>
460 <li>FFI: Compile and optimize array/struct copies.</li>
461 <li>FFI: Compile <tt>ffi.typeof(cdata|ctype)</tt>, <tt>ffi.sizeof()</tt>, <tt>ffi.alignof()</tt>, <tt>ffi.offsetof()</tt> and <tt>ffi.gc()</tt>.</li>
462</ul></li>
463</ul>
464
465<h2 id="LuaJIT-2.0.0-beta10">LuaJIT 2.0.0-beta10 &mdash; 2012-05-09</h2>
466<ul>
467<li>New features:
468<ul>
469<li>The MIPS of LuaJIT is complete. It requires a CPU conforming to the
470MIPS32&nbsp;R1 architecture with hardware FPU. O32 hard-fp ABI,
471little-endian or big-endian.</li>
472<li>Auto-detect target arch via cross-compiler. No need for
473<tt>TARGET=arch</tt> anymore.</li>
474<li>Make DynASM compatible with Lua 5.2.</li>
475<li>From Lua 5.2: Try <tt>__tostring</tt> metamethod on non-string error
476messages..</li>
477</ul></li>
478<li>Correctness and completeness:
479<ul>
480<li>Fix parsing of hex literals with exponents.</li>
481<li>Fix bytecode dump for certain number constants.</li>
482<li>Fix argument type in error message for relative arguments.</li>
483<li>Fix argument error handling on Lua stacks without a frame.</li>
484<li>Add missing mcode limit check in assembler backend.</li>
485<li>Fix compilation on OpenBSD.</li>
486<li>Avoid recursive GC steps after GC-triggered trace exit.</li>
487<li>Replace <tt>&lt;unwind.h&gt;</tt> definitions with our own.</li>
488<li>Fix OSX build issues. Bump minimum required OSX version to 10.4.</li>
489<li>Fix discharge order of comparisons in Lua parser.</li>
490<li>Ensure running <tt>__gc</tt> of userdata created in <tt>__gc</tt>
491at state close.</li>
492<li>Limit number of userdata <tt>__gc</tt> separations at state close.</li>
493<li>Fix bytecode <tt>JMP</tt> slot range when optimizing
494<tt>and</tt>/<tt>or</tt> with constant LHS.</li>
495<li>Fix DSE of <tt>USTORE</tt>.</li>
496<li>Make <tt>lua_concat()</tt> work from C&nbsp;hook with partial frame.</li>
497<li>Add required PHIs for implicit conversions, e.g. via <tt>XREF</tt>
498forwarding.</li>
499<li>Add more comparison variants to Valgrind suppressions file.</li>
500<li>Disable loading bytecode with an extra header (BOM or <tt>#!</tt>).</li>
501<li>Fix PHI stack slot syncing.</li>
502<li>ARM: Reorder type/value tests to silence Valgrind.</li>
503<li>ARM: Fix register allocation for <tt>ldrd</tt>-optimized
504<tt>HREFK</tt>.</li>
505<li>ARM: Fix conditional branch fixup for <tt>OBAR</tt>.</li>
506<li>ARM: Invoke SPLIT pass for <tt>double</tt> args in FFI call.</li>
507<li>ARM: Handle all <tt>CALL*</tt> ops with <tt>double</tt> results in
508SPLIT pass.</li>
509<li>ARM: Fix rejoin of <tt>POW</tt> in SPLIT pass.</li>
510<li>ARM: Fix compilation of <tt>math.sinh</tt>, <tt>math.cosh</tt>,
511<tt>math.tanh</tt>.</li>
512<li>ARM, PPC: Avoid pointless arg clearing in <tt>BC_IFUNCF</tt>.</li>
513<li>PPC: Fix resume after yield from hook.</li>
514<li>PPC: Fix argument checking for <tt>rawget()</tt>.</li>
515<li>PPC: Fix fusion of floating-point <tt>XLOAD</tt>/<tt>XSTORE</tt>.</li>
516<li>PPC: Fix <tt>HREFK</tt> code generation for huge tables.</li>
517<li>PPC: Use builtin D-Cache/I-Cache sync code.</li>
518</ul></li>
519<li>FFI library:
520<ul>
521<li>Ignore empty statements in <tt>ffi.cdef()</tt>.</li>
522<li>Ignore number parsing errors while skipping definitions.</li>
523<li>Don't touch frame in callbacks with tailcalls to fast functions.</li>
524<li>Fix library unloading on POSIX systems.</li>
525<li>Finalize cdata before userdata when closing the state.</li>
526<li>Change <tt>ffi.load()</tt> library name resolution for Cygwin.</li>
527<li>Fix resolving of function name redirects on Windows/x86.</li>
528<li>Fix symbol resolving error messages on Windows.</li>
529<li>Fix blacklisting of C functions calling callbacks.</li>
530<li>Fix result type of pointer difference.</li>
531<li>Use correct PC in FFI metamethod error message.</li>
532<li>Allow <tt>'typedef _Bool int BOOL;'</tt> for the Windows API.</li>
533<li>Don't record test for bool result of call, if ignored.</li>
534</ul></li>
535</ul>
536
537<h2 id="LuaJIT-2.0.0-beta9">LuaJIT 2.0.0-beta9 &mdash; 2011-12-14</h2>
538<ul>
539<li>New features:
540<ul>
541<li>PPC port of LuaJIT is complete. Default is the dual-number port
542(usually faster). Single-number port selectable via <tt>src/Makefile</tt>
543at build time.</li>
544<li>Add FFI callback support.</li>
545<li>Extend <tt>-b</tt> to generate <tt>.c</tt>, <tt>.h</tt> or <tt>.obj/.o</tt>
546files with embedded bytecode.</li>
547<li>Allow loading embedded bytecode with <tt>require()</tt>.</li>
548<li>From Lua 5.2: Change to <tt>'\z'</tt> escape. Reject undefined escape
549sequences.</li>
550</ul></li>
551<li>Correctness and completeness:
552<ul>
553<li>Fix OSX 10.7 build. Fix <tt>install_name</tt> and versioning on OSX.</li>
554<li>Fix iOS build.</li>
555<li>Install <tt>dis_arm.lua</tt>, too.</li>
556<li>Mark installed shared library as executable.</li>
557<li>Add debug option to <tt>msvcbuild.bat</tt> and improve error handling.</li>
558<li>Fix data-flow analysis for iterators.</li>
559<li>Fix forced unwinding triggered by external unwinder.</li>
560<li>Record missing <tt>for</tt> loop slot loads (return to lower frame).</li>
561<li>Always use ANSI variants of Windows system functions.</li>
562<li>Fix GC barrier for multi-result table constructor (<tt>TSETM</tt>).</li>
563<li>Fix/add various FOLD rules.</li>
564<li>Add potential PHI for number conversions due to type instability.</li>
565<li>Do not eliminate PHIs only referenced from other PHIs.</li>
566<li>Correctly anchor implicit number to string conversions in Lua/C API.</li>
567<li>Fix various stack limit checks.</li>
568<li>x64: Use thread-safe exceptions for external unwinding (GCC platforms).</li>
569<li>x64: Fix result type of cdata index conversions.</li>
570<li>x64: Fix <tt>math.random()</tt> and <tt>bit.bswap()</tt> code generation.</li>
571<li>x64: Fix <tt>lightuserdata</tt> comparisons.</li>
572<li>x64: Always extend stack-passed arguments to pointer size.</li>
573<li>ARM: Many fixes to code generation backend.</li>
574<li>PPC/e500: Fix dispatch for binop metamethods.</li>
575<li>PPC/e500: Save/restore condition registers when entering/leaving the VM.</li>
576<li>PPC/e500: Fix write barrier in stores of strings to upvalues.</li>
577</ul></li>
578<li>FFI library:
579<ul>
580<li>Fix C comment parsing.</li>
581<li>Fix snapshot optimization for cdata comparisons.</li>
582<li>Fix recording of const/enum lookups in namespaces.</li>
583<li>Fix call argument and return handling for <tt>I8/U8/I16/U16</tt> types.</li>
584<li>Fix unfused loads of float fields.</li>
585<li>Fix <tt>ffi.string()</tt> recording.</li>
586<li>Save <tt>GetLastError()</tt> around <tt>ffi.load()</tt> and symbol
587resolving, too.</li>
588<li>Improve ld script detection in <tt>ffi.load()</tt>.</li>
589<li>Record loads/stores to external variables in namespaces.</li>
590<li>Compile calls to stdcall, fastcall and vararg functions.</li>
591<li>Treat function ctypes like pointers in comparisons.</li>
592<li>Resolve <tt>__call</tt> metamethod for pointers, too.</li>
593<li>Record C function calls with bool return values.</li>
594<li>Record <tt>ffi.errno()</tt>.</li>
595<li>x86: Fix number to <tt>uint32_t</tt> conversion rounding.</li>
596<li>x86: Fix 64 bit arithmetic in assembler backend.</li>
597<li>x64: Fix struct-by-value calling conventions.</li>
598<li>ARM: Ensure invocation of SPLIT pass for float conversions.</li>
599</ul></li>
600<li>Structural and performance enhancements:
601<ul>
602<li>Display trace types with <tt>-jv</tt> and <tt>-jdump</tt>.</li>
603<li>Record isolated calls. But prefer recording loops over calls.</li>
604<li>Specialize to prototype for non-monomorphic functions. Solves the
605trace-explosion problem for closure-heavy programming styles.</li>
606<li>Always generate a portable <tt>vmdef.lua</tt>. Easier for distros.</li>
607</ul></li>
608</ul>
609
610<h2 id="LuaJIT-2.0.0-beta8">LuaJIT 2.0.0-beta8 &mdash; 2011-06-23</h2>
611<ul>
612<li>New features:
613<ul>
614<li>Soft-float ARM port of LuaJIT is complete.</li>
615<li>Add support for bytecode loading/saving and <tt>-b</tt> command line
616option.</li>
617<li>From Lua 5.2: <tt>__len</tt> metamethod for tables
618(disabled by default).</li>
619</ul></li>
620<li>Correctness and completeness:
621<ul>
622<li>ARM: Misc. fixes for interpreter.</li>
623<li>x86/x64: Fix <tt>bit.*</tt> argument checking in interpreter.</li>
624<li>Catch early out-of-memory in memory allocator initialization.</li>
625<li>Fix data-flow analysis for paths leading to an upvalue close.</li>
626<li>Fix check for missing arguments in <tt>string.format()</tt>.</li>
627<li>Fix Solaris/x86 build (note: not a supported target).</li>
628<li>Fix recording of loops with instable directions in side traces.</li>
629<li>x86/x64: Fix fusion of comparisons with <tt>u8</tt>/<tt>u16</tt>
630<tt>XLOAD</tt>.</li>
631<li>x86/x64: Fix register allocation for variable shifts.</li>
632</ul></li>
633<li>FFI library:
634<ul>
635<li>Add <tt>ffi.errno()</tt>. Save <tt>errno</tt>/<tt>GetLastError()</tt>
636around allocations etc.</li>
637<li>Fix <tt>__gc</tt> for VLA/VLS cdata objects.</li>
638<li>Fix recording of casts from 32 bit cdata pointers to integers.</li>
639<li><tt>tonumber(cdata)</tt> returns <tt>nil</tt> for non-numbers.</li>
640<li>Show address pointed to for <tt>tostring(pointer)</tt>.</li>
641<li>Print <tt>NULL</tt> pointers as <tt>"cdata&lt;... *&gt;: NULL"</tt>.</li>
642<li>Support <tt>__tostring</tt> metamethod for pointers to structs, too.</li>
643</ul></li>
644<li>Structural and performance enhancements:
645<ul>
646<li>More tuning for loop unrolling heuristics.</li>
647<li>Flatten and compress in-memory debug info (saves ~70%).</li>
648</ul></li>
649</ul>
650
651<h2 id="LuaJIT-2.0.0-beta7">LuaJIT 2.0.0-beta7 &mdash; 2011-05-05</h2>
652<ul>
653<li>New features:
654<ul>
655<li>ARM port of the LuaJIT interpreter is complete.</li>
656<li>FFI library: Add <tt>ffi.gc()</tt>, <tt>ffi.metatype()</tt>,
657<tt>ffi.istype()</tt>.</li>
658<li>FFI library: Resolve ld script redirection in <tt>ffi.load()</tt>.</li>
659<li>From Lua 5.2: <tt>package.searchpath()</tt>, <tt>fp:read("*L")</tt>,
660<tt>load(string)</tt>.</li>
661<li>From Lua 5.2, disabled by default: empty statement,
662<tt>table.unpack()</tt>, modified <tt>coroutine.running()</tt>.</li>
663</ul></li>
664<li>Correctness and completeness:
665<ul>
666<li>FFI library: numerous fixes.</li>
667<li>Fix type mismatches in store-to-load forwarding.</li>
668<li>Fix error handling within metamethods.</li>
669<li>Fix <tt>table.maxn()</tt>.</li>
670<li>Improve accuracy of <tt>x^-k</tt> on x64.</li>
671<li>Fix code generation for Intel Atom in x64 mode.</li>
672<li>Fix narrowing of POW.</li>
673<li>Fix recording of retried fast functions.</li>
674<li>Fix code generation for <tt>bit.bnot()</tt> and multiplies.</li>
675<li>Fix error location within cpcall frames.</li>
676<li>Add workaround for old libgcc unwind bug.</li>
677<li>Fix <tt>lua_yield()</tt> and <tt>getmetatable(lightuserdata)</tt> on x64.</li>
678<li>Misc. fixes for PPC/e500 interpreter.</li>
679<li>Fix stack slot updates for down-recursion.</li>
680</ul></li>
681<li>Structural and performance enhancements:
682<ul>
683<li>Add dual-number mode (int/double) for the VM. Enabled for ARM.</li>
684<li>Improve narrowing of arithmetic operators and <tt>for</tt> loops.</li>
685<li>Tune loop unrolling heuristics and increase trace recorder limits.</li>
686<li>Eliminate dead slots in snapshots using bytecode data-flow analysis.</li>
687<li>Avoid phantom stores to proxy tables.</li>
688<li>Optimize lookups in empty proxy tables.</li>
689<li>Improve bytecode optimization of <tt>and</tt>/<tt>or</tt> operators.</li>
690</ul></li>
691</ul>
692
693<h2 id="LuaJIT-2.0.0-beta6">LuaJIT 2.0.0-beta6 &mdash; 2011-02-11</h2>
694<ul>
695<li>New features:
696<ul>
697<li>PowerPC/e500v2 port of the LuaJIT interpreter is complete.</li>
698<li>Various minor features from Lua 5.2: Hex escapes in literals,
699<tt>'\*'</tt> escape, reversible <tt>string.format("%q",s)</tt>,
700<tt>"%g"</tt> pattern, <tt>table.sort</tt> checks callbacks,
701<tt>os.exit(status|true|false[,close])</tt>.</li>
702<li>Lua 5.2 <tt>__pairs</tt> and <tt>__ipairs</tt> metamethods
703(disabled by default).</li>
704<li>Initial release of the FFI library.</li>
705</ul></li>
706<li>Correctness and completeness:
707<ul>
708<li>Fix <tt>string.format()</tt> for non-finite numbers.</li>
709<li>Fix memory leak when compiled to use the built-in allocator.</li>
710<li>x86/x64: Fix unnecessary resize in <tt>TSETM</tt> bytecode.</li>
711<li>Fix various GC issues with traces and <tt>jit.flush()</tt>.</li>
712<li>x64: Fix fusion of indexes for array references.</li>
713<li>x86/x64: Fix stack overflow handling for coroutine results.</li>
714<li>Enable low-2GB memory allocation on FreeBSD/x64.</li>
715<li>Fix <tt>collectgarbage("count")</tt> result if more than 2GB is in use.</li>
716<li>Fix parsing of hex floats.</li>
717<li>x86/x64: Fix loop branch inversion with trailing
718<tt>HREF+NE/EQ</tt>.</li>
719<li>Add <tt>jit.os</tt> string.</li>
720<li><tt>coroutine.create()</tt> permits running C functions, too.</li>
721<li>Fix OSX build to work with newer ld64 versions.</li>
722<li>Fix bytecode optimization of <tt>and</tt>/<tt>or</tt> operators.</li>
723</ul></li>
724<li>Structural and performance enhancements:
725<ul>
726<li>Emit specialized bytecode for <tt>pairs()</tt>/<tt>next()</tt>.</li>
727<li>Improve bytecode coalescing of <tt>nil</tt> constants.</li>
728<li>Compile calls to vararg functions.</li>
729<li>Compile <tt>select()</tt>.</li>
730<li>Improve alias analysis, esp. for loads from allocations.</li>
731<li>Tuning of various compiler heuristics.</li>
732<li>Refactor and extend IR conversion instructions.</li>
733<li>x86/x64: Various backend enhancements related to the FFI.</li>
734<li>Add SPLIT pass to split 64 bit IR instructions for 32 bit CPUs.</li>
735</ul></li>
736</ul>
737
738<h2 id="LuaJIT-2.0.0-beta5">LuaJIT 2.0.0-beta5 &mdash; 2010-08-24</h2>
739<ul>
740<li>Correctness and completeness:
741<ul>
742<li>Fix trace exit dispatch to function headers.</li>
743<li>Fix Windows and OSX builds with LUAJIT_DISABLE_JIT.</li>
744<li>Reorganize and fix placement of generated machine code on x64.</li>
745<li>Fix TNEW in x64 interpreter.</li>
746<li>Do not eliminate PHIs for values only referenced from side exits.</li>
747<li>OS-independent canonicalization of strings for non-finite numbers.</li>
748<li>Fix <tt>string.char()</tt> range check on x64.</li>
749<li>Fix <tt>tostring()</tt> resolving within <tt>print()</tt>.</li>
750<li>Fix error handling for <tt>next()</tt>.</li>
751<li>Fix passing of constant arguments to external calls on x64.</li>
752<li>Fix interpreter argument check for two-argument SSE math functions.</li>
753<li>Fix C frame chain corruption caused by <tt>lua_cpcall()</tt>.</li>
754<li>Fix return from <tt>pcall()</tt> within active hook.</li>
755</ul></li>
756<li>Structural and performance enhancements:
757<ul>
758<li>Replace on-trace GC frame syncing with interpreter exit.</li>
759<li>Improve hash lookup specialization by not removing dead keys during GC.</li>
760<li>Turn traces into true GC objects.</li>
761<li>Avoid starting a GC cycle immediately after library init.</li>
762<li>Add weak guards to improve dead-code elimination.</li>
763<li>Speed up string interning.</li>
764</ul></li>
765</ul>
766
767<h2 id="LuaJIT-2.0.0-beta4">LuaJIT 2.0.0-beta4 &mdash; 2010-03-28</h2>
768<ul>
769<li>Correctness and completeness:
770<ul>
771<li>Fix precondition for on-trace creation of table keys.</li>
772<li>Fix <tt>{f()}</tt> on x64 when table is resized.</li>
773<li>Fix folding of ordered comparisons with same references.</li>
774<li>Fix snapshot restores for multi-result bytecodes.</li>
775<li>Fix potential hang when recording bytecode with nested closures.</li>
776<li>Fix recording of <tt>getmetatable()</tt>, <tt>tonumber()</tt> and bad argument types.</li>
777<li>Fix SLOAD fusion across returns to lower frames.</li>
778</ul></li>
779<li>Structural and performance enhancements:
780<ul>
781<li>Add array bounds check elimination. <tt>-Oabc</tt> is enabled by default.</li>
782<li>More tuning for x64, e.g. smaller table objects.</li>
783</ul></li>
784</ul>
785
786<h2 id="LuaJIT-2.0.0-beta3">LuaJIT 2.0.0-beta3 &mdash; 2010-03-07</h2>
787<ul>
788<li>LuaJIT x64 port:
789<ul>
790<li>Port integrated memory allocator to Linux/x64, Windows/x64 and OSX/x64.</li>
791<li>Port interpreter and JIT compiler to x64.</li>
792<li>Port DynASM to x64.</li>
793<li>Many 32/64 bit cleanups in the VM.</li>
794<li>Allow building the interpreter with either x87 or SSE2 arithmetics.</li>
795<li>Add external unwinding and C++ exception interop (default on x64).</li>
796</ul></li>
797<li>Correctness and completeness:
798<ul>
799<li>Fix constructor bytecode generation for certain conditional values.</li>
800<li>Fix some cases of ordered string comparisons.</li>
801<li>Fix <tt>lua_tocfunction()</tt>.</li>
802<li>Fix cutoff register in JMP bytecode for some conditional expressions.</li>
803<li>Fix PHI marking algorithm for references from variant slots.</li>
804<li>Fix <tt>package.cpath</tt> for non-default PREFIX.</li>
805<li>Fix DWARF2 frame unwind information for interpreter on OSX.</li>
806<li>Drive the GC forward on string allocations in the parser.</li>
807<li>Implement call/return hooks (zero-cost if disabled).</li>
808<li>Implement yield from C hooks.</li>
809<li>Disable JIT compiler on older non-SSE2 CPUs instead of aborting.</li>
810</ul></li>
811<li>Structural and performance enhancements:
812<ul>
813<li>Compile recursive code (tail-, up- and down-recursion).</li>
814<li>Improve heuristics for bytecode penalties and blacklisting.</li>
815<li>Split CALL/FUNC recording and clean up fast function call semantics.</li>
816<li>Major redesign of internal function call handling.</li>
817<li>Improve FOR loop const specialization and integerness checks.</li>
818<li>Switch to pre-initialized stacks. Avoid frame-clearing.</li>
819<li>Colocation of prototypes and related data: bytecode, constants, debug info.</li>
820<li>Cleanup parser and streamline bytecode generation.</li>
821<li>Add support for weak IR references to register allocator.</li>
822<li>Switch to compressed, extensible snapshots.</li>
823<li>Compile returns to frames below the start frame.</li>
824<li>Improve alias analysis of upvalues using a disambiguation hash value.</li>
825<li>Compile floor/ceil/trunc to SSE2 helper calls or SSE4.1 instructions.</li>
826<li>Add generic C call handling to IR and backend.</li>
827<li>Improve KNUM fuse vs. load heuristics.</li>
828<li>Compile various <tt>io.*()</tt> functions.</li>
829<li>Compile <tt>math.sinh()</tt>, <tt>math.cosh()</tt>, <tt>math.tanh()</tt>
830and <tt>math.random()</tt>.</li>
831</ul></li>
832</ul>
833
834<h2 id="LuaJIT-2.0.0-beta2">LuaJIT 2.0.0-beta2 &mdash; 2009-11-09</h2>
835<ul>
836<li>Reorganize build system. Build static+shared library on POSIX.</li>
837<li>Allow C++ exception conversion on all platforms
838using a wrapper function.</li>
839<li>Automatically catch C++ exceptions and rethrow Lua error
840(DWARF2 only).</li>
841<li>Check for the correct x87 FPU precision at strategic points.</li>
842<li>Always use wrappers for libm functions.</li>
843<li>Resurrect metamethod name strings before copying them.</li>
844<li>Mark current trace, even if compiler is idle.</li>
845<li>Ensure FILE metatable is created only once.</li>
846<li>Fix type comparisons when different integer types are involved.</li>
847<li>Fix <tt>getmetatable()</tt> recording.</li>
848<li>Fix TDUP with dead keys in template table.</li>
849<li><tt>jit.flush(tr)</tt> returns status.
850Prevent manual flush of a trace that's still linked.</li>
851<li>Improve register allocation heuristics for invariant references.</li>
852<li>Compile the push/pop variants of <tt>table.insert()</tt> and
853<tt>table.remove()</tt>.</li>
854<li>Compatibility with MSVC <tt>link&nbsp/debug</tt>.</li>
855<li>Fix <tt>lua_iscfunction()</tt>.</li>
856<li>Fix <tt>math.random()</tt> when compiled with <tt>-fpic</tt> (OSX).</li>
857<li>Fix <tt>table.maxn()</tt>.</li>
858<li>Bump <tt>MACOSX_DEPLOYMENT_TARGET</tt> to <tt>10.4</tt></li>
859<li><tt>luaL_check*()</tt> and <tt>luaL_opt*()</tt> now support
860negative arguments, too.<br>
861This matches the behavior of Lua 5.1, but not the specification.</li>
862</ul>
863
864<h2 id="LuaJIT-2.0.0-beta1">LuaJIT 2.0.0-beta1 &mdash; 2009-10-31</h2>
865<ul>
866<li>This is the first public release of LuaJIT 2.0.</li>
867<li>The whole VM has been rewritten from the ground up, so there's
868no point in listing differences over earlier versions.</li>
869</ul>
870</div>
871<br class="flush">
872</div>
873<div id="foot">
874<hr class="hide">
875Copyright &copy; 2005-2020
876<span class="noprint">
877&middot;
878<a href="contact.html">Contact</a>
879</span>
880</div>
881</body>
882</html>