You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* cbindlist
* add cbind by reference, timing
* R prototype of mergelist
* wording
* use lower overhead funs
* stick to int32 for now, correct R_alloc
* bmerge C refactor for codecov and one loop for speed
* address revealed codecov gaps
* refactor vecseq for codecov
* seqexp helper, some alloccol export on C
* bmerge codecov, types handled in R bmerge already
* better comment seqexp
* bmerge mult=error #655
* multiple new C utils
* swap if branches
* explain new C utils
* comments mostly
* reduce conflicts to PR #4386
* comment C code
* address multiple matches during update-on-join #3747
* Revert "address multiple matches during update-on-join #3747"
This reverts commit b64c0c3.
* merge.dt has temporarily mult arg, for testing
* minor changes to cbindlist c
* dev mergelist, for single pair now
* add quiet option to cc()
* mergelist tests
* add check for names to perhaps.dt
* rm mult from merge.dt method
* rework, clean, polish multer, fix righ and full joins
* make full join symmetric
* mergepair inner function to loop on
* extra check for symmetric
* mergelist manual
* ensure no df-dt passed where list expected
* comments and manual
* handle 0 cols tables
* more tests
* more tests and debugging
* move more logic closer to bmerge, simplify mergepair
* more tests
* revert not used changes
* reduce not needed checks, cleanup
* copy arg behavior, manual, no tests yet
* cbindlist manual, export both
* cleanup processing bmerge to dtmatch
* test function match order for easier preview
* vecseq gets short-circuit
* batch test allow browser
* big cleanup
* remmove unneeded stuff, reduce diff
* more cleanup, minor manual fixes
* add proper test scripts
* comment out not used code for coverage
* more tests, some nocopy opts
* rename sql test script, should fix codecov
* simplify dtmatch inner branch
* more precise copy, now copy only T or F
* unused arg not yet in api, wording
* comments and refer issues
* codecov
* hasindex coverage
* codecov gap
* tests for join using key, cols argument
* fix missing import forderv
* more tests, improve missing on handling
* more tests for order of inner and full join for long keys
* new allow.cartesian option, #4383, #914
* reduce diff, improve codecov
* reduce diff, comments
* need more DT, not lists, mergelist 3+ tbls
* proper escape heavy check
* unit tests
* more tests, address overalloc failure
* mergelist and cbindlist retain index
* manual, examples
* fix manual
* minor clarify in manual
* retain keys, right outer join for snowflake schema joins
* duplicates in cbindlist
* recycling in cbindlist
* escape 0 input in copyCols
* empty input handling
* closing cbindlist
* vectorized _on_ and _join.many_ arg
* rename dtmatch to dtmerge
* vectorized args: how, mult
push down input validation
add support for cross join, semi join, anti join
* full join, reduce overhead for mult=error
* mult default value dynamic
* fix manual
* add "see details" to Rd
* mention shared on in arg description
* amend feedback from Michael
* semi and anti joins will not reorder x columns
* spelling, thx to @jan-glx
* check all new funs used and add comments
* bugfix, sort=T needed for now
* Update NEWS.md
* NEWS placement
* numbering
* ascArg->order
* attempt to restore from master
* Update to stopf() error style
* Need isFrame for now
* More quality checks: any(!x)->!all(x); use vapply_1{b,c,i}
* really restore from master
* try to PROTECT() before duplicate()
* update error message in test
* appease the rchk gods
* extraneous space
* missing ';'
* use catf
* simplify perhapsDataTableR
* move sqlite.Rraw.manual into other.Rraw
* simplify for loop
* first pass at publishable NEWS
* ws
* failed merge
* failed merge pt ii
* shrink diff
* pass at style
* Ditch mergelist(copy=) for setmergelist
* Put cols=NULL default into the signature to avoid missing() quirks
* Explain 'NULL' in cols= in Rd
* First pass on grammar for \arguments
* finish style+grammar pass
* restore 'join.many' to signature
* use 'try' for known error in example
* tweak examples
* Add \references for star/snowflake schema terminology
* fix test error messages, remove extra '[]' from brackify errors
* rm unreachable error
* coverage
* first pass at local() style tests
* linted style
* semicolons, spacing
* rearrange tests using options to be in nested local() calls
* restore new 'l' for rearranged tests; re-capture test using 'l' in local()
* Jan's clarifying comment
* Another pass at style, annotation; remove some duplicate tests
* more refinement of test structure, comments
* finished mergelist.Rraw
* more whitespace in constructed SQL queries
* style, continued
* more formal styling with lintr
* update reference to other.Rraw tests
* return output invisibly for set* functions
* mention setmergelist in NEWS
* numbering
---------
Co-authored-by: Michael Chirico <michaelchirico4@gmail.com>
Co-authored-by: Michael Chirico <chiricom@google.com>
Copy file name to clipboardExpand all lines: NEWS.md
+4Lines changed: 4 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -48,6 +48,10 @@
48
48
49
49
11.New`frev(x)`asafasteranalogueto`base::rev()`foratomicvectors/lists, [#5885](https://github.com/Rdatatable/data.table/issues/5885). Twice as fast as `base::rev()` on large inputs, and faster with more threads. Thanks to Benjamin Schwendinger for suggesting and implementing.
50
50
51
+
12.New`cbindlist()`and`setcbindlist()`forconcatenatinga`list`ofdata.tablescolumn-wise, evocativeoftheanalogous`do.call(rbind, l)`<->`rbindlist(l)`, [#2576](https://github.com/Rdatatable/data.table/issues/2576). `setcbindlist()` does so without making any copies. Thanks @MichaelChirico for the FR, @jangorecki for the PR, and @MichaelChirico for extensive reviews and fine-tuning.
52
+
53
+
13.New`mergelist()`and`setmergelist()`similarlywork_ala_`Reduce()`torecursivelymergea`list`ofdata.tables, [#599](https://github.com/Rdatatable/data.table/issues/599). Different join modes (_left_, _inner_, _full_, _right_, _semi_, _anti_, and _cross_) are supported through the `how` argument; duplicate handling goes through the `mult` argument. `setmergelist()` carefully avoids copies where one is not needed, e.g. in a 1:1 left join. Thanks Patrick Nicholson for the FR (in 2013!), @jangorecki for the PR, and @MichaelChirico for extensive reviews and fine-tuning.
54
+
51
55
### BUG FIXES
52
56
53
57
1.`fread()`nolongerwarnsoncertainsystemsonR4.5.0+wherethefileownercan't be resolved, [#6918](https://github.com/Rdatatable/data.table/issues/6918). Thanks @ProfFancyPants for the report and PR.
0 commit comments